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A computer program is described here which will do an 
in-depth grading analysis of a student's attempt to produce a 
required German sentence. The grading program is. given a description 
of the expected response and the sentence actually constructed by the 
student^ the two are compared, and a complete diagnostic report of 
the rcorrectness of the grammar of the student's response is provided 
for the teacher. The primary limitations on the program at this point 
are that it^handles only grammar and not meaning, it can process only 
a subset of the German language, and the grading program has only 
been operated in isolation thus far and has not been included in a 
teaching environment. (Author/RH) 
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Chapter One 
INIRODLCTICW 

This dissertation is motivated by a concern for an 
essential aji^pect oi' te^chinc., particularly of teaching by 
cojiiputer: the nc-cessitY of c^radinc the students response 
to a question jXDsed by the teacher- By grs.din£:, I mean 
analyzing for correctness, diao^osing what has been done 
both correctly and inccri^ectly • My v;ork concentrates on 
language teaching, specifically the teaching ol" GeriT;an 
graiTiFtar, and works at analyzing .sentence-sized responses* 
In the restricted .area of j^ttern practice and controlled 
translation, in which the general form and content of the 
sentence are specified as pairt of the rules of the gamej I 
have produced a computer prograrr: which can do an in-depth 
grading analysis. I v/ill not ixresunie to claim to replace a 
huiTian teacher, but I do assert that a substantial step has 
been taken towards duplicating many of the. strong j)oints of 
a good t'eacher's grammatical analysis* 

A few caveats: 

1 ♦ My work deals primarily with the grammar of 
a language, not language as -a means of communicating 
ideas. It is intended for an environment where a sentence 
like "I ate the 'books" may re^jresent a less serious error 
than "I ate them eggs." 
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2. This research does not represent an exhaustive 
treatment of the German langua^^e. It is rather intended as 
a demonstration of how grading might be done; to tliat end 
some features are handled very extensively, while others 
have so far received very little attention. 

3. The grading prograjT: its intended for operation in 
a teaching environment, of course, I have not yet 
constructed such an environment. I merely assume that the 
student v;ill be asked, in some way, to produce a specific 
German sentence, and worry about its analysis. 

Operation: 

The grading program is given tv;o inputs, a 
description of the expected response and the sentence 
actually constructed by the student. Using its built-in 
knowledge of German grammar, it analyzes the student 
sentence for conformity with both the expected response 
and the applicable rules of German grammar, and produces 
a comprehensive diagnostic report. (This report is intended 
for a teacher, not directly for the student. ) An example 
of the program in operation may be helpful at this point. 

The expected sentence is "Jetzt will er dem 
Maedchen die Tuer aufmachen" (*'Now he. v;ill open the door 
for the girl")- In the first example, the student has 
responded correctly. In the second, a large number of 
errors are present, including a change to a first person 



subject. I have api>ended a few explanatory notes, set off 
in square brackets, to facilitate understanding wiiat is 
goine on; a fuller description can be found in several 
suceeding sections of this dissertation. 



RUN 1 



Description generated by human expert: 
(STMT (VERB WOLLEN (AUI^ • J.^ACHEN); 
(SJ ER) 

(OJ ACC EIE TUER) 

(OJ DAT DEf^ liAEDCHEK) 

(PREDMOD (ADVERB JETZT))) 

[This is the description of the expected 
sentence, containing both its words and 
structure. To wit, a statement (STMT) 
with two verbs ("wollen" & "auf machen" ) ; 
subject a pronoun '"er" (SJ ER); an 
accusative object (OJ ACC) consisting of 
the noun phrase "die Tuer" ; a dative 
object; and finally, a predicate-modifier 
in the form of the adverb '^jetzt".] 

Sentence input by student 

JEIZT WILL ER DEM MAEDCHEN DIE TUER AUIl^iACKEJ] 

[The student's actual input, which will 
be the subject of the following analysis*] 

NOW PROCEEDING WITH ANALYSIS 



I^The first report will be from the 
inflectional analysis of the dative 
object "dem Maedchen'**] 

. . . SWiMARY REPORT FOR NOUN PHRASE INFLECTION 
ON DEM MAEDCHEN 

Best we can do guessing intent is: (DAT S (M N) ) 
CONFIDENCE LEVEL: VERYHIGH 

[The inflectional analysis looks at what 
the student actually did and tries to 
find an interpretation for the particular 
inflectional forms* ••in this case, it 
feels that the pair "dem Maedchen" 
reflects an inflection for dative, 
singular, masculine or neuter.] 
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GENDEI? REPORT PGR hOUllz MAEDClriEi^i :: WEAK CORRECT 

e;cpected n , coull be (m jj) 

[A report coxsparing the above conclusiori 
with what is expected or correct.] 

CASE REPORl POR OBJECT: DEK MALJ;CHEI^ zz 

STRONG CORRECT = DAT 

• SUl'.MARY REPORT FOR 1^'OL.N PHTxASE INFLECTION ON DIE TUER 

Best we caii do cuessin£ intent is: ((NOM ACC-)-- S F) 
XONriDEI^XE LEVEL: VERYHIGH 

[The coinputer's informed ^uess, as 
before. The "confidence level" refers 
to the ^ues£i, not to the state of the 
student s knowledge.] 

GEI^DER REPORT FOR NOUN: TUER :: 

STRONG CORRECT = F 

CASE REPORT' FOR OBJECT: DIE TUER : WEAK CORRECT 

EXPECTJ^D ACC , COULD BE (NOM ACC) 

CAGE REPORT FOR SUBJECT^: ER : : 
STRONG CORRECT = NOW ■ 

VERB ENDING CHECK .•OK! 

[This is a check on the forn: "will" of 
the main (or "finite" ) verb "wollen". 
It appears this late because the progi^ain 
had to first find the subject before 
it could check subject-verb a£3:eeinent. ] 

Phrase division done, sentence type report: 

-Final verdict is.FV-2 , level of confidence: VERYHIGH 
[This is the basic v;ord— order check 
reporting on its findings. "FV-2" 
means Finite Verb-Second, the required 
pattern for a statement in German. 
- "Confidence" a^in is an internal 
f performance assessment. ] 

Play by Play account of analysis: 

Due to: INITIAL HYPOTHESIS , FVT set to FV-.2 

Confidence in PVT setting is: VERYllIGH 

Sub mar y Comment's 

Clause type correct (FV-2) 

— EKD OF RUN — — — 
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^ ' ^RUN 2 

Description generated "by human expert: 
(STMT tVERB WOLLEN (AUP . HACHEN)) 
ASJ ER) 
'(OJ ACC LIE TUER) 

(OJ DAT DEt; K.^CHEN) .-^o-.^ 
(PREDMOD (ADV.SRB JETZT))) 

[Description is the same as for run 1.] 

Sentence input by student 

JETZT ICH WILL MACJIN DEN TUER AUF BER KAEDCHEIJ 

[The student sentence is quite different 
for this run. It represents an attempt 
at the same response, but with nany 
errors. ] 

NOW PROCEEDING WITH ANALYSIS 



... VERB MACJrOa^ . . APPEARS AS MACHN 

MISSING -E ON INFINITIV'E EI-JDING 

[The stem is **ii3ach-", and since an 
infinitive is required it should appear 
as "machen." There is no interpretation 
under which the "-n" is correct, so the 
program coiijplains about improper formation 
of the expected infinitive,] 



SUMARY REPORT FOR NOUN PHRASE INFLECTION ON DEK TUER 
Best we can do guessing intent is: (ACC S K) 

CONEIDENCE LEVjEL: HIGIi 

[Note that the confidence level is only 
"high" ... the construction is not 
correct, as "tuer" is a feminine noun^ 
The program has a hypothesis about v/hat 
the student intends; this is quite close 
to the expected values, so the program 
is reasonably confident of its guess.] 



• GENDER REPORT FOR NOUN: TUER :: INCORRECT 
Should be F is M 

[Eased on' the guess above 
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CASE REPORT FOR OBJECT : DEN TUEFi : : 
STRONG CORRECT = ACC 

[Again based on the hypothesis on what 
the student intended. Notice that the 
program has thus been able to establish 
correct performance lor case and identify 
specif il3'a£t^- an error in gender 
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SUMMARY REPOPT' NOUN PHRASE INFLECTION 

ON DEP ' 

Best we car itent is: (DAT S F) 

CONFIDENCi; J;; 

[vt2, j.ar comments apply here.] 

GENDER REPORT TOR NOUN: MAEDCHEN :: INCORRECT 
Should be N Is F 

^CASE REPORT FOR OBJECT: DER KAEDCHEN :: 
STRONG CORRECT = DAT 

PN SUBSTITUTION.. Should be (3 NGM S M PER) 
is ((WORD ICH (I C H)) : (1 NOM S * PER)) 

[At this point the prosram has sucessf ully 
coped with an error which resulted in 
a totally different form, the change 
from the expected "er" to "ich". The 
message gives tlie internal representation 
for the inflectional characteristics 
associated with the two forms, from which 
one can see that the only change is in 
I>erson, from third to first. ] 

CASE REPORT FOR SUBJECT: ICH :: 
STRONG CORRECT = NOM. 

VERB ENDING CHECK ..OK! 

[Even though the subject was not the 
expected one, the program can check 
verb agreement with what actually was 
present. [1 

Phrase division done, sentence type report : 
No confidence in sentence type, probably confused 
last straw was: Separable-prefix confusion 

[This sentence is pretty badly mangled, 
and the word- order check finally gives 
up trying to match It to a German 
pattern. ] 

Play by Play account of analysis: 

Due to: INITIAL HYPOTHESIS , FVT set to FV-2 

Confidence in FVT setting is: VERYHIGH 
[This is a standard beginning for the 
word— order analysis of a sentence: 
start in good faith, reduce the level " ^ 
of confidence as errors are observed. 
"FVT" refers to Finite Verb Type, the 
classification of the sentence (FV— 2).] 
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Due to: DI not separated 

Confidence in FVT setting is: HIGH 

[Oops, the dependent infinitive (DI) is 
positioned immediately after the finite 
verb, rather t^ian where it belonss at 
the end of the clause. Also, its 
separable prefix "auf" has been detached, 
an error for an infinitive ; more on that 
later on. ] 

Due to: TWO MEMBER FROKT FIELD- 

Confidence in PVT setting is: MED 

[The i^attern "FV-2" means that the finite 
verb must be the second element in the 
clause, but here it is preceeded by two 
elements, a definite syntax error* ] 

Due to: Possible English pattern in I • error 
Confidence in FVTsetting is: LOW 
[Furthermore, the pattern looks 
suspiciously like English* ] 

Due to: Separable— prefix confusion 

Confidence in FVT setting is: mm 

[As mentioned above, the prefix ought 
not to be separated from the infinitive 
••machen" ; also, the prefix is in a 
random location, not even at the end 
of the clause. At this point, the 
program gives up. ] 

other comments: 

[A rehash of some of the important 
findings which might be of interest 
even if the play— by— play analysis is 
not consiilted.] 
DI position error ^ not separated from FV 
Front field error (FV-2): 2 elements in F.F. 



- END OF RUK 



In constructing the grader, I have been guided 
primarily by the need to approach the problem via the rules 
and structure of German grammar. The use of general 
linguistic information about German is indeed central to 
my work. ^ The possibility of erroneous input seriously 
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limits the applicability of an analysis based solely on 
r-eneral linguistic knowledge: the ver^/ sxajiunatical 
constructions on which a c^neral parser depends are subject 
to distortion dur o'just the kind of student niistakes that 
the pro£xarr i.. des ned to analyze. I attempt, therefore, 
to strike a comproiriise between generality and diagnostic 
caj^ability. I supply a description of the expected 
sentence to aid the analytic routines, but have designed 
their to accept this description without being entirely 
bound to it letter by letter. Furthermore, the routines 
responsible for carrying out the various grammatical 
functions of the program are all constructed to deal with 
errors as a direct part of their functioning, not merely a>^ 
a recovery measure. 

The "correctness" of a student response in an 
instructional setting depends on a number of factors. 
Internal agreement rules are always considered, often 
subordinating the check on conformity with the expected 
answer. Partially correct answers are acknowledged as 
such; errors due to the propagation of misconceptions are 
so identified (rather than being counted as additional 
separate errors) ; plausible explanations are offered for 
many "incorrect" forms. 

The program is very flexible in its ability to 
follow the student thrcx;igh many kinds of distortion and 
error. It is precise, in reporting independent errors 
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independently* It is vigilant, insisting always on 
checking the most simple rules as well as the more complex 
ones. It is thoroiogh, gathering information from one 
response on a large number of different aspects of student 

^^orr e, through its ability to analyze an entire 
sentence. I believe that this grader will develop into an 
invaluable constituent of a computer teaching system which 
is truly able to adapt to the student. 

I regret that this project must be so intimately 
involved with German. Natural language being the complex 
entity that it is, one cannot exi)ect a simple theory to be 
at all sufficient. In the course of accommodating the 
special cases which together maJce up German grammar, I have 
evolved various general principles, which will be presented 
below. Some are specific to German, but many are of much 
broader applicability. 
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Chapter Two 
LITERATURE REVIEW 

t 

This dissertation involves analysis of the German 
language, and thus belongs in a general way to the set of 
projects concerned with the analysis of natural language. 
In its development to date, my project has involved 
relatively simple grammatical constructionis, placing few 
demands on the relatively modest parsing scheme I have 
employed. Certainly, modern work in linguistics has helped 
my project; in particular, the use of structural linguistics 
natural language grammar texts has been quite helpful. I 
have been able to use the grammatical schemata of the 
Stanford German text <ref. 10»1; Lohnes and Strothmann, 
lg68> extensively, with additional help from the 
Reference Grammar <ref. 9*1; Lederer, Schulz and Griesbach, 
1969> (see Appendix A for more details). The suitability of 
these grammars, combined with the current ^simplicity of the 
grammatical constructions handled by my program, has greatly 
lessened the need to tura to more formal grammatical 
techniques. 

In an important respect, my work stands apart 
from the other natural langiaage analysis projects. Despite 
a great diversity among these projects, none manifest an 
interest in analyzing and grading incorrectly formed 
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laneuage.* Whereas I take the meaning of the sentence for 
granted, they are primarily interested in obtaining that 
meaning as the output of their analysis. Nor b^ve I made 
much use of their theories of parsing, due mainly to the 
need to integrate error analysis procedures into every step 
of my program. 

As this project continues to grow in grammatical 
complexity, I believe I shall, "be able to continue building 
on the foundations I have laid. As the grammatical 
complexity increases, some of the more sophisticated 
techniques developed by ciorrent researchers may well be of 
help in this project. In particular, the techniques used by 
Winograd <ref. 22.1; 1971> seem rather promising (though 
their response to erroneous forms would have to be adapted 
to my needs) . 

Education, particularly Computer— Assisted 
Instruction ("CAI"), is the field closest to this project. 
My interests are in grading, with eventual applications to 
teaching. Most CAI projects are primarily concerned with 
the operational aspects of teaching, and have given little 
attention to elaborate grading techniques. Still, there are 
a jfew comments I should like to make^^ concerning certain 
salient points: 



* Not even those projects connected with Computer— Assisted 
Instruction. 

o ... 
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— techniques of response recognition, particularly for 

large, structured answers; 

— error analysis, providing interpretations for specific 

kinds of wrong answers. 

I have consciously incorporated a substantial 
amount of knowledge of German grammar into the computer 
program. In so doing, I join a growing number of projects, 
chiefly in Artificial Intelligence, which believe that the 
computer must know what it is doing if it is to function 
intelligently-* I am convinced that Computer— Assisted 
Instruction can approach the level of a "good" human teacher 
only when the computer "knows what it is doing". We expect 
our human teachers to be trained in their subject, and look 
down on those v;ho are one lesson ahead of their class. The 
computer will need at least as good credentials. 

Traditional CAI projects generally do not attempt to 
have the computer "understand" the subject domain. These 
programs do, perforce, analyze answers, and sometimes draw 
conclusions from that analysis about what the student had in 
mind. A brief look may be of interest. 

A large amount of CAI material is built around a 
fixed recognition scheme: the student responds in a 
multiple— choice mode, or in what I term "hidden 
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* See, especially, Winograd, who makes this point quite 
eloquently for understanding natural language. <ref . 22. 2> 
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multiple-choice*' in which his supposedly free response is 
matched against pre-specif ied alternatives. These schemes 
are attractive in allowing the instructor the freedom to 
associate arbitrariH.y complex in LerpretaLj.ons to specific 
be}r:avior. 

Por example, consider s simple exercise: 

PRESENT TO SlEDENT?: "1 see the hook" 

PRESENT TO SSHDENTr "Ich sehe Buch" 

REQUEST AMSMER 

CORRECT ANSWER.: "das"; (TYPE "correct", PROCEED) 

INCORRECT MfiWjER: "die"; '"V 

TYPE "^'the noun 'Buch' is roeuter, not feminine" 

INCORRECT ANSWiU : ' "*der" ; 

TYPE "the noun 'Buch' is neuter, not masculine" 

The ^eneusl failure tcis search out and use error 
iiittterpretations may lae due in part to. the nonautoma.tic 
nature of the tool pxrnjvided to the ILesson programmer: each 
scnswer must be individually ;anticipated and accounted for. 
The construction of asmplete rand accurate programs can be 
extxemely difficult. Jilote that even the above, simple 
exami!0.e contains two ei-rors: the possible form "den" has 
been omitted; and the; message for "der" neglects to note 
that; its use would in^iy more than a mere gender error, 
but also a mistake in case or else the use of an incorrect 
f onai! f or the masculine aaccusative ax-ticle. 
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Another point worth noting is the extreme difficulty 
of applying simple recogr^ition procedures to multi— element 
re:.ponses. When the multiple elements form a sentence in a 
natural langua<ge, the analysis is particularly demanding, 
due to considerations of both sequence and structure. Some 
valiant projects have come up with analysis techniques good 
enough to run experimental courses teaching natural- 
languages. These techniques involved the introduction of at 
most the barest rudiments of subject-matter knowledge into 
lower— level analysis; mainly the programs have relied on 
subject— independent calculations, including spelling checks 
and sensitivity to permutation of the elements making up the 
response. These techniques lack reliability in a structured 
environment : they are not certain to reach the right 
concluisions about an answer, and they may be easily misled. 
Not surprisingly, they also are thin in diagnostic 
abilities. 

One such project involved an experimental course in 
German, sponsored by IBM-Yorktown in the mid— 196G's. 
<ref. 1.1>, Their program was able to cope with multiple 
errors, though on a manual basis (each error ha<i to be 
individually anticipated). For publication, they present 
the sentence "Er kommen heute in der Stadt" ("He is coming 
to the city today"), and observe that "two unrelated errors 
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are present." Unfortunately: 

The l^irst, a particular wrong inflection of 
the verb •'kommen", was not forseen and looked 
for hy the programmer; it was detected by the 
machine as essentially a S5>elling error. 

<ref. 1.2; Adams, 1968> \ 

Their program also had a capacity for handling permutations 
in word order; but the algorithm takes no account of the 
structure of the German language, being concerned only with 
matching strings of characters and words. 

The Prench project at IMSSS,* Stanford <ref. 14,1>, 
recently provided a beautiful example of the pitfalls 
accompanying a nonstructural permutation analysis. A 
student was expected to produce the sentence "Georges est 
xm enfant" ("George is a child"), but ajrparently forgot 
that no article is required for a proper noiin, and typed 
"Un Georges est ion enfant" (literally, "A George is a 
child"). Because of the similarity of the two articles. 
The program was able to match the student's "Un Georges 
est" with the expected "Georges est un". After it 
determined that a permutation of order had taken place 
among these three words ^ the program went on to complain 
that the second "un" of the student's answer was an extra 
word! <ref. 14*2; SchoU, 1972> 

Seeking greater flexibility, some researchers have 
used a generative approach to CAI. This technique involves 

Q Institute for Mathematical Studies in the Social Sciences 
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a program capable of generatin£ exercises in seme automatic 
fashion. .These programs incorporate a rudimentary 
knowledge of the subject matter into the "generation 
algorithm, as they, must "be able to compute the correct 
answer. (Note that it is not necessary to have the ability 
to solve the problem in order to generate it.) Rarely is 
/ the automatic portion extended to provide even the simple 
aJLternative-answer recognition capability cf the 
fi:5^ed— recognition systems. 

Siklossy presents a representative generational 
program, accompanied by a call for the general 
incorporation of subject-rmatter knowledge, in his article 
"Computer Tutors That Know What They Teach." <ref. 15; 
1970> At first reading, his modest project appears to 
follow the same general lines I espouse. His program uses 
its knowledge of basic set theory (its subject-matter) to 
analyze answers for correctness, and also to dynaanically 
explain errors. Siklossy 's main concern, however, lay in 
being able to respond to questions posed by the student. 
PerhaDs for this reason, his program as described stops 
short of trying to work with mistakes which are due to 
misconceptions rather thaii merely faulty technique. The 
program dynamically figures out errors of technique, but 
does not explore alternative explanations which might 
account for the observed response. Thus, for instance, 
consider a problem in v/hich the student is asked to f*orm 
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the intersection of two sets, and produces instead their 
union* The program will, happily use its symbol 
manipulating abilities to analyze the response, and produce 
an error report exiiaustively listing all the eletnents of 
the student's answer which are members of one set but not 
the other* No test is made to see if the too-lar^e answer 
constitutes a correct set union- Similarly, if the 
student commits a common error in forming a union, listing 
some members of the intersection twice, the prograjn rejects 
his' answer i-^ith a perem.ptory "'net a set". 

I would be quite optimistic about the prospects of 
expanding his program along the lines discussed in the body 
of this report, especially to include attention to the 
possible existence of conceptual eirors and in general to 
" attempt analysis of errors for probable cause*. His case 
for the value of incorporating sub ject— matter material is 
well taken, and has resulted in a sound program on which 
one could base my kind of response analysis. 

Two large projects, by Wexler and Carbonell, come 
reasonably close to my general analysis goal, albeit in a 
totally different subject area. <refs. 21.1 & 2.1; ig70> 
These two are basically Artificial Intelligence programs. 
They, incorporate a large, complex network of da>a from the 
subject matter, and routines to manipulate it to both 
generate problems and analyze answers. The ability to 



- 21 - 

extract from the data-Tuase an alternate explanation for a 
stiident's "wrong" answer is just the kind of performance I 
am after. To form an informal example from Wexler 

<ref. 21. 2>, consider that the student is to be asked 
about the capital of Massachusetts. His response would 
have to satisfy criteria such as: 

1) must be a city 

2) must be in Massachusetts 

3) must be the state capital 

If the student responds "Springfield" , the computer 
will find: 

a) Criteria (1) and (2) are satisfied (Springfield is a 
city, in Massachusetts), but (3) fails, it is not the state 
capital. . 

The computer might also find : 

b) Criteria (1) and (3) hold while (2) fails, as a city 
called Springfield is the state capital of Illinois. 

One can see how ap£)ropriate messages can be 
constructed from the information discovered by the two 
analyses. The format of the information would facilitate 
the recording of general trends ^ should the instructional 
program so desire. For instance, (a) "student does not know 
which city in a state is its capital," or (b) "student 
tends to confuse the capitals of different states." Both 
Wexler and Carbonell have been interested in error analysis 
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primarily to enable their programs to reply to the student 

with more interesting comments, and so to help him with his 

immediate difficulty. In particular, they do not address a 

question which is of major importance in my work: how to 

deal with the ambiguities resulting from multiple 

interpretations of responses, as in (a) and (b) above. 

I should add that Carbonell does devote several 

pages to discussing error analysis, in spite, of a strong 

statement of policy which has often accompanied a profound 

disinterest in considering student errors:* 

If problems develop the ultimate objective is 
for the student to overcome them, not for the 
teacher to diagnose them. <ref, 2.2> 

Carbonell clearly recognizes the existence of different 

sorts of errors: 

In the case of symbolic answers, if a student 
asked about the capital of Argentina responds 
Brazilia, he is not making as serious a 
mistake as that made if he would answer 
Brazil (which is a country )- 

<ref. 2.3> 



He suggests that a classification ("taxonomy", <ref. 2,4>) 

of errors is needed to provide a good theoretical basis for 

diagnosis. This is presented largely as suggestions- for 

future work, albeit with a good dose of optimism: 

... if human teachers have serious 
difficulties in dealing with causes of 
errors, it may be premature to expect a 
highly sophisticated behavior of CAI programs 
in this respect. On the other hand, ISC 



*. See a similar quotation from a language teacher. Chapter 6, 
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[Information-Structure Oriented] programs, of 
which SCHOIiAR is the first example, will 
provide a rich environment for research on 
error types, detection, and diagnosis, and on 
consequent teaching strategies • 

<ref^ 2.5> 

I agree, and present iny project as just that kind of 
research 

Concerning meaningful interpretation of errors, a 
pai)er by Suppes and Momingstar <ref, 17; 1970> suggests 
research along a line similar to one I have followed. They 
postulate a cerebral automaton to account for each facet of 
the student ''s performance. In simple arithmetic, for 
example, one automaton mi^t perform the addition of single 
digits, while another would take care of the carries in 
multiple-digit addition problems • Poll owing this theory, 
incorrect perfonnance will occur in two ways: 

There may be a specific malfunction in an automaton, 
such as a failure to process carries correctly • 

2. The student may select a correctly fmctioning 
automaton which happens to be inappropriate, such as one 
which subtracts instead of adding. 

In either case, the teacher faces a well-defined 
task in helping the student with his misconception. 



* I understand that Carbonell has delved further into the 
question of error analysis in his more recent work* 
<ref • 5; private communication, 1972> 
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In its general statement, this theory corresponds 
rather well with what I am doing: we both want to find 
formulations of what has happened which can account for the 
observed deviant perfcrmance. Unfortunately, Suppes does 
not deal with the prohlem of how to go about discovering 
the existence and characteristics of these automata. Kor 
does he discuss the problem of reconciling alternative 
explanations for specific performance. 

Looking to those eiagaged in language teaching, 
particularly German, I find some encourageioent, a little 
help, and much challenge. An experienced teacher can 
easily grade a student "'c-s answer, though without being aware 
of exactly how he goes about doing so. Perhaps an 
intensive study of the way in which huioans analyze 
erroneous sentences would produce some information relevant 
to my project; I am not optimistic, at least not until the 
theoretical work on natural language has progressed much 
further. 

Human teachers do offer explanations for wrong 
answers, again at an intuitive level. In Chapter 6 I will 
enter into a detailed discussion of this behavior and its 
relation to the current project. At thrt time, I will also 
review some recent lingmstic work in the same vein as 
Suppes^ proposal. 



PART TWO 

Matching the Stixlent's Sentence with 
the Expected Words and Structujres 
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Chapter Three 
BASIC METHOD 

The grading program which I have constructed 
represents a compromiise between the wide-ranging capabilites 
of the human teacher, and the limited scope of the simple 
teaching machine. This program contains a large quantity of 
Information about German grammar, which I augment with 
information about the specific sentence I expect to be 
analyzed. The -resultant system is powerful enough to accept 
a wide range of variation in the student's response and 
still produce a meaningful diagnostic report. 

The information given to the program is not the 
exact "correct" sentence, but rather a description of it. 
This description consists primarily of the major words and 
structure of the expected sentence. The program's strateg^^-^ 
is to match the key elements of the description to what 
actually appears in the student's response. Each instance 
of a successful match provides a foothold from which the 
program can work to apply its general rules to analyze and 
•report. 

In the course of setting up the strategy for the 
grading program, I have perforce made certain assumptions 
about the nature of the problem. These assumptions also 
provide a convenient framework for discussion. 
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My Siasic iiremise if^ that the student will produce a 
specifis:^ sejritfcenc<:5*. Put differently, the ^student 's 
perf or2:sn:ce is swSiciently constrained as to warrant 
assumpticonss about iiie specific vocabulary and structure he 
will use isi Eormfn^'his sentence. Thesis particular 
assuniptiions. israr^^ right into the heart of the 

program's pgBration: when a word in the student response can 
be identified as one of those whose presence' was expected, 
the progiraui can iuinediately orient itself with the structure 
of the sentence siiuply by referring to what it knows about 
the expected structure surrounding the word in question • 
(The assumptions of vocabulary and structure take concrete 
form in the specification of the expected sentence, which is 
supplied to the program prior to the analysis.) 

The immediately following discussion will delve 
further into the operational characteristics and features of 
this identification, and consider the type of information 
which can be obtained using a structural orientation. 
In the next chapter, I will discuss the limitation imposed 
on the student by my basic premise. 

I do not assume that the student will correctly 
follow rules of inflection; on the contrary, the program 
design places as little reliance as possible on inflection 
for guiding the analysis* The matching algorithms must, and 
do, allow for variation in form due to inflectional error by 
the sissaieHt: essentially noise in thezisiatching process. 

ERLC 
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For lexical words (nouns, verbs, adjectives) the noise is 
usually not too great, and the word can Soill be recognized 
reliably. A more detailed description of the matching 
algorithms follows later in this section, I would note 
here, in passing,-,, that the matching process is somewhat 
facilitated by the restriction to specific vocabulary. 

Function words (articles, prepositions) pose a 
different problem for the recognizer. The program is able, 
of course, to determine that a particiolar word belongs to a 
given class of function words. But since these words owe 
almost their entire form to inflectional dictates, reliable 
identification must be based on more than orthographic forEi. 
Furthermore, function words (as their nsjne implies) do not 
usually convey meaning by themselves, but by their 
association with lexical words and plirases. I rely heavily 
on structural information in locating function words, makirg 
particular use of their predictably close association with 
lexical words which can be differentiated by forn and 
meaning. 7 

I have made a basic structural assumption, that the 
student will keep the clause-elements of the sentence mainly 
intact; that he will place their constituent words in a 
predictable order. A noun phrase, for example, must appear 
with the article preceeding the noun, or maybe even missing, 
but never following the noun. This assumption provides the 
power needed to cope with function words. 
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As the program proceeds throi:igh the sentence, it is 
primarily looking for lexical words, which it can recognize 
directly. By referring back to the descriptive information,, 
the program can tell what structure surrounds the word it 
has found (also as outlined previously). If this structure 
includes function words, the program need only look for them 
in the place where they should occur- Any candidate in that 
location will be accepted if it is the right kind of word. 
There may be a secondary check to see that it does not 
differ too widely. from the expected word, but that check 
will be analytic, not orthographic; i.e., based on the 
infectional characteristics of the word, not on its 
spelling. Thus, for example:* 

Expect: "das Buch .." ("The book ..,") 

Actual: "die Bucher " 

Examine: "die"; Conclude: article; 

ho identification, yet 

Examine: "Bucher"; Conclude: niatches noun "Buch"; 

Noun phrase was "das Buch", so look 
backwards for article preceeding 
the noun. 

Reexamine: "die"; Article, claim for noun phrase 

The. student may supply some extra words, in spite of 
my basic premise about close specification; or he may omit 
some words, particularly function words. The program must 



* see also example given in Chapter 5, of actual program 
trace from just this kind of situation. 

o 
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of course report on the occurrence of any such deviation 
from the expected. Internally, there is the difficulty that 
the program might push its search for a function word too 
far and claim a word that really belongs to another' clause 
element. For the present, I, solve this problem with another 
assumption, that a clause element must be a contiguous 
block.* Thus if a word belonging to some other element is 
encountered during a function word search, that search must 
fail. (A clause element can, of course, be expanded below 
by the inclusion of dependent clauses as modifiers; the 
program is able to accomodate itself to this situation. ) 

The order of elements themselves in the clause, on 
the other hand, need not meet any assumption imposed by the 
analyzer. Here the issue is even more critical than for 
inflection. I continue to assume that any rule will at some 
time be violated by a student. The German lajiguage, 
moreover, has many extremely flexible rules governing the 
placement of clause elements within the sentence. Even 
assuming that the student's performance will be correct 
would not all.ow predictions about element ordering. I have 
had t^e program explicitly avoid element-order assumptions; 
the scope of any simple structural inferences gained by 
the recognition of an expected word is limited to the 
clause element of which that word is a member. 



* This assumption may be a. bit too strong in scane instances. 
I have one example in which a. student wrote "Wieviel willst 
du Geld?" ("How much would you like money?") j thereby 
O splitting the clause element "wieviel Geld" ("how much money"). 

ERIC 



- 32 - 



Pronouns represent a slight enil:;arrassiDent in the 
scheme I have employed* Their forn: is quite sensitive to 
inflectional alteration, like function words; but pronouns 
stand alone in a clause element (sometimes with the help of 
a preposition), and so there is no structural context to aid 
the search • The program can generally recognize look that a 
word is a pronoun; the problem is naking a correct 
identification, particularly when the expected sentence 
contains more than one pronoun • I am forced to assume that 
the inflection, if not exactly what was expected, will 
nonetheless be "close." The program, accordingly, is 
equipped to decide which of two possible choices is "closer." 
That heuristic seem/i to be doing a good job. 

Word RecopTiition 

This section considers the problems of word 
recognition in greater detail, with particular attention to 
the effects of inflection. The analytic question, of how 
inflectional forms are checked for correctness, will be the 
subject of a later section. (Chapter 7) 

The recognition problem is eased hy the practice of 
supplying the program with the vocabulary from which the 
student sentence should be formed. Since only that 
vcxiabulary is considered, the scope of the search is kept 

* See Chapter ?• 
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quite small, with imediate benefits in the amount of time 
which can "be devoted to each comparison. Before doing any 
scanning of the student's sentence, the program makes up 
specialized search routines for each expected icajor word, 
taking maximum advantage ol' the inlormation it has on each 
word. In particular, a different strategy is employed for 
each word type, to be best able to accomodate inflectional 
noise. These routines do not try to take advantage of the 
limited size of the vocabulary list to make less rigorous 
com-parisons; the program will properly detect an unexpected 
word as one it cannot recognize. 

At this stage of development, the program 
incorporates a simplifying restriction on the kind of 
sentence it can analyze, a- restriction which will certainly 
need to be relaxed in the future. Each lexical word in the 
expected sentence can occur only once. , Sentences like "The 
green car is faster than the red car" cannot presently be 
handled. The benefits for the analyser are nontrivial. In 
forming a firm identification of a word in the student 
sentence with a word of the expected sentence, inflectional 
changes must be taken into account; but nothing beyond the 
specific word need be considered. In particular, the 
program need not worry about the following or preceeding 
word. 
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Misspelling will upset the program as it is 
currently constituted. This capability was omitted thru a 
decision on allocating my resources; it is less innovative, 
and not crucial for German (whose spelling is largely 
phonetic, as compared to English). See also Chapter 10 for 
some suggestions on future work on this problem. 

Inflectional changes, both correct and incorrect, 
constitute the major complication in identifying the words 
in the student sentence. Happily, many major words inflect, 
if at all, by suppletion, i.e., by adding an ending to an 
invariant stem. Such words can be located by merely 
ignoring the ending ( though saving it for later checking, of 
course). For those words which show inflectional changes in 
the stem as well, the change is usually fairly small and 
more or less predictable. The few pathologic words must 
just be handled as special cases. 

Some examples: 
Simple suppletion in noun plurals: 

das Kind, die Kinder (child, children) 

der Knabe, die Knaben (young boy) 
Stem change in noun plurals : . 

der Mann, die Maenner (man, men) 
Greater change in noun plurals: 

der Schutzmann, die Schutzleute (police) 
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I now prcxieed to a detailed description of the word 
matching algorithm for each part of speech. 

Lexical Words ; 

Ad.iectives ; Adverts send predicate adjectives are 
completely uniriflected in German. The matching algorithm 
for these words consists of a full word equality check 
against the expected text. 

Attributive adjectives inflect hy pure suppletion. 
The program does not yet handle this category, due to the 
conplications of interpreting inflectional errors. The 
match will be a stem-plus-ending comparison: a simple 
equality match of the expected text against the actual word 
(stem) allowing for an arbitrary ending on the latter. 

Nouns : Houns inflect, usually, for the plural, and 
that generally by suppletion; the stem vov^^el is uralauted at 
the same time in roughly half of the nouns. My match allows 
for an arbitrar-y ending, as the student may well not 
remember how to form the particular plural ; similarly, I 
have a flexible check for stem vowel, allowing almost any 
vowel or vowels • The program could perhaps check only for 
the correct vov/el with or without an umlaut, but does not 
make that restriction at this time. The flexibility in stem 
and'ehding is extended even when searching for singular 
nouns. The student might, for some reason, supply a plural 
instead; "also the student might not remember the correct 
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singular form of the stem, especially if he had recently 
worked extensively with the plural* 

Specifically, the match pattern consists of 
four fields: 

1, (front part of word) characters specified exactly; 

2. (stem vov/els) match any one or more vcwels; 

3* (rear part of word) characters specified exactly; 

4. (ending) match anything* 

Por example, '*der Mann" (man), plural "die Maenner" 

Pattern is: "M" <STEM VOWELS> "M" <ENDIIJG> 

Match on "Manner" with <STm VOWEI£> = "A" 

<ENDIKG> "ER" 

If a field does not exist, it is specified as blank, 
as for '^der Artz*^ ("doctor") in which nothing preceeds the 
stem vowel "a". 

Verbs : Verbs show some of the most complicated 
inflectional changes of all German words. I have allowed, 
so far, only for infinitive and present tense forms; though 
it is not the recognition problems but the error analysis 
which has delayed expansion into the other tenses. Verbs 
have endings; if irregular, they also must change their stem 
vowels and occasionally even a few consonants. As an 
additional complication, there may be one or more prefixes: 
separable prefix adverbs, the preposition "zu", (and for 
past participles, the form "ge"). As v/ith nouns, I have 
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allowed for incorrect variations on any inflectional 
pattern: any vowel substituted in the stem, arbitrary prefix 
and ending, even seme stem consonant changes* 

My experience so far has not encountered any false 
matches • For some very irregular verbs, the reliability of 
the algorithm may need some bolstering through the addition 
of stronger conditions. Per instance, the verb "essen" (to 
eat) has only a double-S invariant; the present program will 
match any word with a vowel followed by "SS"« To increase 
the reliability of the match, a plausibility check should be 
made on the prefix and ending* Some restriction on vowels 
may also be helpful; in the example, "u" or "ue" would seem 
unlikely candidates for "essen". 

A special matching algorithm will be needed for the 
very irregular auxilllary "sein" (to be), v/hose for^as show 
no resemblance at all to the infinitive. 

Specifically, the match pattern is set up with 
five fields: 

1. (prefix) matches anything zero or more letters; 

2. (tront) exact character match on front part of stem; 
3* (stem vowel) one or more vowels; 

4* (stem consonant) exact match on consonants following 

stem vowels; more than one set may be 
given for verbs that may change this part 
of stem; 

5* (ending) matched anything left over. 
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For example, "icaufer> ' ("to buy"): 

Pattern is: . 

<PREFIX> "K" <STEM VOWELS> <STE1^ CONS. "F"> <EKDING> 

Watch on "kauft" with <PREFIX> = nil 

<STEt4 VOWELS> = "AU" 
<STEM COI]S.> = "F" 
<ENDING> = "T" 

Por example, "nehmen" ("to take"), an irregular verb: 
Pattern is: 

<PREFIX> "N" <STEM VOWELS> <STm CONS. "HM","M"> <ENDIKG> 

Match on "abnimmen" with <PREFIX> = "AB" 

<STEM V01VELS> = "I" 
<STEM CONS.> = "MM" 
<EWDim> = "EH" 

• ' I- 

Function Words ; 

The recognition problem for function words is 
somewhat different from that for lexical words. The main 
routines of the program use structural information, rather 
than purely expected form, in locating the function words. 
The recognition routines are called on to determine if a 
given word is a preposition (or article), and, if so, to 
find out the word's characteristics. 

Prepositions : For prepositions, the recognition 

problem is quite simple, as there is no inflectional change. 

What the student might well do instead is to substitute a 

o . different preposition. In any event, simply checking a list 
ERJC 

^—^ Qf all prepositions suffices nicely. 
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Contraction of prepositions v/ith articles presents 
an annoying complication. I check suspected words against a 
short list of legal contractions. If one is- found , it is a 
simple matter to break it into the two constituents and 
analyze ■ them as if separate.* Errors in forming contractions 
are nearly in the realm of spelling mistakes, and are not 
yet handled; Nor are contractions of preix)sitions with 
pronouns. 

Articles: Articles are very hig}-ay inflected words 
with a plethora of orthographic forms. My recognition 
algorithms allow for any inflectional form of any kind of 
article. (The major prohlem, of interpreting what the 
student has done, rests on the inflection-checking logic) 
For a definite article to be recognized, its form must 
correspond exactly with one of the actual definite articles, 
as "d-" is not a sufficient stem for generalized checking; 
for other types, the recognition is based on stem pliAS 
arbitrary ending (just as for attributive adjectives). 

Algorithm: first search the list of definite 
articles for an exact match; if so, TYPE is ^'definite", 
ENDING is taken from the list... "die - e", "das - es", "der 
- er", "den - en", "dem - em". If the word is not a 
definite article, then try for a stem-plus-arbitrary-ending 

* this check is actually made on every word, before any 
other checks are made on it. The inefficiency of the extra 
search are made up for by avoiding problems of crossed 
O pointers when splitting a contraction. 
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match usin£; all the other article stews; set TYPE to 
"Eli^WORD", "DIES-WORD'S or "POSSESSIVE" accordlnely. 

Pronouns : ^ . 

Pronouns piesent a major problem to the program as a 
whole, but not too much headache for the recognizer, which 
is only asked v/hether a particular word is a pronoun. 
Gernan pronouns inflect mostly by drastic changes in form, 
so a simple check against a list of all personal pronouns is 
sufficient to make the roLi£.h identif ication. A slight 
complication occurs in definite pronouns, whose forms are 
mostly the same as for the definite articles. Again, the 
question of is it or isn'^t it can be resolved by checking 
against a list of all legal possibilities; the stickier 
question, as to whether the word is a jjronoun or really just 
an article, is taken care of by the pronoun search logic. 
(See Chapters 5 and 7 for more details.) 
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Chapter Four 
IMPLICATIONS OF THE LIMITING ASSUMPl'IONS 

In the previous chapter, I set forth the various 
assumptions under which my program operates. These 
assumptions are acceptable as definir^ and delimiting a 
specific project; however, the relation of this project to 
the outside world requires some further discussion. In 
particular, I should like to discuss the definition of the 
boundary created by the operating assumptions, and consider 
the justification cf these assumptions from an educational 
point of view. 

Boundary ; 

The program at this time adheres rather closely to 
the assumption that the student is asfced for a specific 
sentence and will endeavor to produce that sentence. The 
program conducts its search and analysis under close 
guidance from the expected answer. Although it is 
sensitive to many variations, the progE'am does not attempt 
any kind of general analysis which woifLd encompass an 
arbitrary German sentence. In paarticiilar, there is little 
attention paid to determining the meaning of the student's 
sentence as possibly different, from "the expectation. There 
is a basic problem at issue, which has to be solved by the 

o 
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ba;sis2: i#rf 1 osophy of the £xadin£ project: how distin£ui.sh 

dxsifcCE±:T(?Tri [granmiatical error] froni intentionsl semantic 

chacge.*. I believe i:hat soir.e headway can be made in making 

such a^: distinction, but for the present project have made 

the program concentrate only on error .analysis; in fact, I 

have designed it to operate in spite of possible alternate 

meanings and grammatical ambiguities which might otherwise 

have frustrated the analysis • 

Currently, any deviations from the expected 

' formulation, either in individual words or in structure, 

are analysed as local errors only. Certain manglings -can, 

of course, result in a sentence with a different meaning, 

possibly expressed in "correct German'* if taken alone. For 

example , 

-sentence- -program will report — 

Die Frau sieht ihn* Correct 
("The woman ^e.s him.'^) 

2. DielSiSU sieht l&mi. Pronoun wromg, perh^s 

dalzive insSread of ^cusative • 

3* Die Srau sieht ar. PronoLin wrong, perhaps 

nominative instead 

Sentence£^3 tepP^J^-S to l:>e also a correct way of e:mressing 
the thoiagtilt '^?iie sees tte womarA," rsince '^dae Prau" can be 
either ii^#i§ZLative (subject) or accusative (object). The 
program wfS not be distracted by this different meaning, 

^ for conciseness, the program output is paraphrased. 
Actual examples of the diagnostic comments can be seen 
elsewhere* 
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It need not remain totally isncrance of the alternative, 
either. In future work, I would like to explore 
incorporating certain special checks depending on specific 
errors. Various heuristic approaches will be needed; the 
above example could have been caxaght by checking possible 
inversion whenever an object appeared in nominative case. 

Limitation; theoretic 

The assumption of a specific sentence, aj?,d the 
information provided to the program, provide a great deal 
of assistence to the analyser, but at the same time greatly 
restrict its generality. I have tended to justify the 
restriction as necessary to be able to cope effectively 
with a natural language situation, and indeed I am willing 
to make sacrifices in order to obtain powerful grading 
facilities. I believe there is a basic theoretical 
question here, as to whether extra information: is really 
needed to recognize improperly formed sentences. A few 
comments may serve to illustrate the problem, and, with It, 
my choice of assumptions, 

(An informal inf ormation-theQJEetic arg,a!ii:ent) 
As allowance ds made fer the possibility that the 
input will not conform to the rules of German grammar, 
supplementary information is needed to compensate for the 
assumptions we are no longer entitled to make. If 
inflections or word-order are possibly wrong, some other 

o 

ERIC 



- 44 " 



source must be found for the information th^/t they provide 
about the meanirixg of the sentence • Turned around, the 
argument is that by having other sources of information, 
the grader can regard these rules as items to be checked 
for correctness rather than as crucial aids in completing 
the analysis. 

A human teacher knows what he asteed, and thus has 
extra, ^sentence— specific information available when he is 
grading the student response. 

Limitation: education ^ 

It rem^SBS to examine the restrictions from an 
educational point of view.. Is there any place for a 
grading prograjn with such restricted scope that it can only 
handle one specific sentence at a time? furthermore, wh^t 
do language teachers feel about such restrictions? Both 
questions can be^ answered favorably: even though the 
project has beejs kept carefully circumscribed, it still has 
possible uses. 

One conBHon technictue used by aimodern approach to 

language teachiing is to have the student "overlearn** basic 

sentences. These basic sentences, or parts of them, are 

used as "building blocks*' as the student goes on to new 

material, producing a situation in which vocabulary will 

be naturally quite restricted. From one such text: 

This lesson is devoted to review ... of 
85 sentences, of which some are identical 
with the basic sentences of the preceding 
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lessons, some are new combinations of the 
elements of the basic sentences, and some 
contain material to increase your reading 
vocabulary. These sentences are in German in 
the left-hand column, with equivalent English 
sentiences in the right-hand columji. Study 
these by first reading the German sentences, 
referring to the English sentences only when 
you are" uncertain. Repeat this study until 
you can recognize and understand accurately 
every one of the German sentences. 

The next step is to look at the 
English equivalent sentences and check your 
ability to produce the proper German 
sentience. 

<ref . 12. 1r, Rehder & Twadell, 1958> 



In another text, tine directions to the student for 
a ^particular exercise leave very little room for inventive 
work: 

Express the following sentences in German. 
This is not meant to be a translatidn 
exercise; the English sentences should 
"trigger" their German equivalent s.« 

<ref. 10.2; Lohnes & Strothmann, 1968> 



A third example seems almost tailored exactly to 

■f-rtr the capabilities of my program. (The citation is from 

the Introduction, a general description some of the 

material offered by the text, a beginning reader): 

synthetic exercises, made up of 
"dehydrated" sentences . . . The student is 
given the thought content and vocabulai*y of a 
p»articular sentence and is asked to 
reconstruct the sentence by "adding grammar." 

<ref. 19; 1967> 



Philosophically, language teachers recognize that 
freedom of expression has its place. That place may 
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not be everywhere: 

In most ca^^es, putting the meaning across is 
a minor x)art of teaching a dialog* 

<3ref. 8.1; Lado, 1964> 



We conclude that il: is not the tocher's task 
in first— j^ear Geinnnnn to chat discuirsi^l^ 
with his students, tottt to guide tfceri S21 the 
controlled formatioir of habits. So fer as 
the students are CGfieemed, the mat«ri^ they 
are practicing: is itiie German language -:fer the 
time being* ,We need -and probably sKoirBd: not 
go beyond this materiiial; "other ways oSI 
saying the same tiling" are a distracticai, not 
an enrichment, for :th:e beginning stud&mt. 

<ref. 12U2; Rehder & Twed^^ ir:956> 



I migiit addi thai :IE do not nec^sssrife^ agree with 
all the opinions whicii 1; jiaave cited, and woiild certainly be 
^PPy to have greater flexibility available, jff^t the same 
time, I do not lose sight of some rather signif icant^'i^ 
possible benefits possible even with the iiioiiiiations 
intact. By being able to vaccept f llLI seji*emD:es and 
monitor, automatically, aLj.arge number of* rffinfferent rules, 
the grading prograjn immediately permits a imDre flexible 
ciirriculum than would be ipossible with a Jiess versatile 
grading facility • I feel that I have already come far 
enough that research into using the grader in new flexible 
curricula can begin in earnest. 



EKLC 



- 47 - 



Chapter Pive 
PROGRAM ARCHITECTURE 

Uais section describes the basic architeeiy;^ of 
the program, and includes a fairly detailed di^e3^i*3£i of 
the algorithm involved. 

The program is given two major inputscsfa. 
description of the expected sentence, preparediJ&y' ^fciie 
teacher- or helper; and the actual student seniaaas; ssdach is 
to be graded. The analysis takes place in f ourip^ases:: 

1. pass through the description of the expeeil^i; jiiQiitence, 

to build a guiding structure; 

2. main pass through the student sen tence , re^^pizing 

key words and collecting associated woros: Sw; 
directed search; 

3. second scan of the sentence to collect prcffiiMiEs: and 

separable prefixes; 

4. word-order analysis of the clause-element sSsruciiure 

to which the sentence has been reduced. 

Here is an exsjnple of the program's actual 
operation which I will use at several points of this 
discussion. The student is asked for "Er kauft ea:nen:iHut" 
("He buys a hat"), and responds correctly. The following 
output represents just the grading comments; later sai: the 

o 
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discussion, I shall present the sane analysis with the 
program printing internal tracing commentary as well. 



RUN'I. 

Description generated by human expert: 

(STMT (SJ ERJ (VERB KAUFEIJ) (OJ ACC EINE?J HUT)) 

SEIJTENCE INPUT BY STUDENT .. 
Ell-IEM HUT KAUFi' ER 

NOW PROCEEDING WITH MAI.YSIS 



SUMMARY RElWi' FOR NOUN PKxASE INFLECTION 

ON Eli^EN HUT 
Best we can do guessing intent is: (ACC S K) 
CONFIDENCE LEVEL: VERYHIGH 

» GEI\'DER REPORT FOR NOUN: HU^' :: 

STRONG CORRECT = K 

. * . CASE REPORT FOR OBJECT : EIKFi^f HUT : : 

STRONG CORRECT = ACC 

CASE REPORT FCl SUBJECT': ER :: 

STRONG CORRECT = NOM 

VERB mWim CHECK ..OK! 

Phrase division done, sentence type report: 

Final verdict is FV-2 , level of confidence: 
VEFamOH 

Play by Play account of analysis: 

Due to: INITIAL HYPOTHESIS , FVT set to FV-2 

Confidence in FVT setting is: VERYHIGH 

Summary Comments 
Clause type correct (FV-2) 

: EtiL OF Rm 



Operation of Phase One ; 

The program begins with a pass through the 
description in order to build the complex structure it 
needs for attacking the student sentence, 
o ■ 
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For each clause of the expected sentence, a list of 
the important words* is collected into a list called 
WDLIST, At the same time, the pro£^ain builds a structure 
for each clause element, containing the consitutent words 
and information which will be needed in the analysis. The 
structural details communicated in the sentence description 
play an important role in this stage of building. 
Cross-reference pointers are set up between the word 
entries in WDLIST and the corresponding clause element 
structures, and the latter are collected together into a 
SDLIST. Pronouns, because of their special problems, are 
collected not in WELIST but in another place of their own, 
PNLIST. 

At the end of Phase One, the sentence description 
consists of the accumulated WDLIST, SDLIST and PNLIST's for 
each clause. Specific information on the particular words, 
needed in various stages of the analysis, is available to 
the program by this time ; it has either been given as part 
of the description, saved from previous work, or requested 
from the experimenter during Phaise One. 

The other main input to the program, the t'tudent 
sentence, is stin but a simple collection of words. They 
axe arranged in such a way as to facilitate the analysis. 

To continue the previous example, 
"Er kauft einen Hut." ("He buys a hat.") 



* See Chapter 3 for definition of this concept 
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The initial description, as prepared by a teacher and 

Civen to the program, contains both the text and structure 

of the expectj3d sentence: 

(STMT (VEfiB KAUFEN) 
(SUBJECT 

(PRONOUN ER)) 
(OBJfXT ACC 
(KP 

(ARTICLE EIKEN) 
(NOUN HUT) ))) 

(The abbreviated descriptors "SJ" & "OJ" which appear in 
the full examples actually expand into the more explicit 
structure jOiown here. I have not placed much emphasis on 
raaking the procram smarter about the input description, 
concentrating my efforts instead on the analysis of the 
student sentence given whatever seemed the most helpful 
description, ) 

Here is a complete listing of the WDLIST as set up 
for the analysis of "Er kauft einen Hut." It contains two 
search functions, NOUN-WD for the noun "Hut" and VERB~V/D to 
look for a form of the verb "kaufen". The other 
inforDiation represents the parameters needed for the 
identification, particularly the patterns for guiding the 
match algorithm. ;Che cross-reference pointers to the 
SDLIST terms are held in WS-7 (for the noun) and WD-6 (for 
the verb) . 

((NGUN-WD WI>-11 HUT ((H) ((SV . W) T (END))) (WS-7)) 
(VERB-WD (QUOTE WD-5) ' - 

(QUOTE KAUFEK) 

(QUOTE ((PRE) K(SV . V) (SC (P)) (EI©))) 
(QUOTE WD-6))} 
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Here are the SDLIST terms, a^ain as functions with a large 
numher of parameters. The COMELTA-SD functions handle 

either subject or object the sixth argument tells 

which. The last argument is itself a structure of 
functions, describing the constituent words or phrases* 
The first argument of each of the top-level SDLIST 
functions is the name of the variable holding the 
cross-reference pointer from the key WDLIST or Pi^ILIST 
function. (My comments in []'s). 

rWS-2 points here:] 
(COMELTA~SD (QUOTE WS-2) 
QUOTE PP-3) 
QUOTE NOM) 

QUOTE (SUBJECT SPEC)) 
QUOTE NIL) 
QUOTE SUBJECT) 
QUOTE ( 

(PROWOUN-SD WJ>-4 ER (3 NOM S M PER))))) 
[WS-7 points here:] 
(COMELTA-SD (QUOTE WS-7) 
QUOTE PP-8) 
QUOTE ACC) 

QUOTE (OBJECT SPEC)) 
QUOTES) 
QUOTE OBJECT) 
QUOTE ( 

(i^JP-SD SD-9 

(ARTICLE-SD WI>-10 EINEW) 
(UOUN-SD WD-11 HUT ((CSSV . U) 

(XGENDER . K) (HNOUJMPLAG . T) 
(XNCLASS .0))))))) 

[Wr^6 points here:] 
(VERB-SD (QUOTE Wr^5) 

QUOTE KAUFEN) 
'quote VERBI) 

' quote CL&VERB1 ) , , ^ ■^ x ^ 

QUOl'E ((SS {F.F SC) (AU AU SV)) (CLASS . 1))))) 
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The PNLIST looks just like the WDLIST, except that it has 
only pronoun terms at the start. The value of WS-2 is 
a pointer to the SDLIST term. 

apRONOUN-WD WI>-4 ER WIL (WS-2) (3 Iv'OM S M PER))) 
(PP-3 PP-8) 
FV-2 ) ) 

Initial form of student input: 

((WORD ICH) 
■ (WORD KAUPE) 
(WORD EINEW) 
(WORD MMTEL)) 

(Actually, the "exploded" form of each word is also 
included, to boost operating efficiency; thus, 
(WORD ICH. (I C H)). ) 

Each element on the WDLIST is concerned with one of 
the major words of the expected sentence. The element 
specifies two thin£;s, a search function and a 
cross-reference pointer. During the second phase of the 
program, the search routine will be given a sequence of 
characters: a word from the. student's sentence. The 
routine must decide if that sequence matches its target 
word, i.e., the particular major word from the expected 
sentence with which it is associated. This identification 
task is entrusted to a matching algorithm specially 
tailored to the particular word- type in question. (see 
Chapter 3, above, for a description of these algorithms). 
[The name of the particular fumction is usually that of the 
word- type, suffixed with an identifying "-WD", as 

o 
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"NOUW-WD".] The necessary information specifying the 
specific word to be found is all assembled by the Phase One 
processing. 

The cross-reference pointer will be discussed more 
fully below. 



Phage Two ; 

After the first phase has finished. Phase Two 
begins by activating the clause executive function with the 
WDUST,. SDLIST AND PNLIST for the main clause. The basic 
scan of the sentence is undertaken at this time, proceeding 
from left to right. Pursuant to the strategy discussed 
above, the top level activity is concerned with finding 
major words. The major word recognizers are collected on 
the WDLIST; the job of presenting the next word of the 
sentence and deciding what to do with it falls to a 
function named RUNWD. 
RUNWD: 

1. No words left: set PINISHED flag for clause 

exec ut i ve ; exi t . 

2. See if word is a function word (article, 

preposition). If so, remember it, 
go on to next word. 

3. For each element of the WDLIST, activate the 

match function. If one liiies the current 
word: 

3a. remove that entry from WDLIST; 
3b. set up pointer into corresponding SDLIST 
entry (variable is "WD-SD PTR") ; 

exit. 

4. Test if word is a pronoun: note,, go on 

to next word. 
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(step 5 is relevant only for relative clauses ••.) 

5* No match yet,. •look througli all other WDLIST's, 
as in step 3. if one matches, then: 

5a. set NEWCLAUSE flag for clause executive, 
saving the name of the clause 
, containing the lucky WdLIST element; 
5b. set FINISHED fl^ig if moving up a level; 
■ 5c. do steps 3a and jjb above; 
exit. 

6. If none of the above, complain about extra word. 

The r cognition functions cn WDLIST do very little 
analysis of the words they find. Their importance lies in 
their links to the analysis structures on the SDLIST. 
. ■ Each word (on V/DLIST) has a pointer to the structure for 

the clause element of which it is a part. This pointer 
leads to the head of the clause element of which the word 
is a part, not to the SDLIST function for the word itself. 
In the case of a clause element with more than one m.ajor 
word, all corresponding WDLIST entries would have the same 
pointer. A clause elements SDLIST entry reflects the 
structure of the element; entering it from the top gives 
the program full analytic control of the context while it 
is processing any part of the structure. 

Here is the previous example, this time including 
output relevant to the internal workings of the program. 
Some of the comments refer to features not yet discussed. 
For the present, I would call attention to lines beginning 
with one of the following legends: 

"DOWD just did " — identifies a WDLIST function which 
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successfully matched a word of the student sentence 
"SDDO: " — beginning execution of an SDLIST structure 

which was activated by a cross-reference pointer; 
"TKDO: »« — beginning execution of individual functions 

within the SDLIST entry currently active. 



■ RUN 1 

New tasks: ((V-i'VENDCK)) 
DOWD jtist did NOUN-WD 

SDDO: COMELTA-SD (WS-7) 
THDO: NP-SD 
THDO: ARTICLE-SD 
THDO: NOUN-SD 
THDO: ARTICIiE-SD 

New tasks: ((NICK TEM-'i2 TEM-13)) 

SUMMARY REPORT FOR NOUN PHRASE INFLECTION 

ON EINEN HUT 
Best we can do guessing intent is: (ACC S M) 
CONFIDENCE LEVEL: VERYHIGH 

GENDER REPORT FOR NOUN: HUT :: 

STRONG CORRECT = M 
Task just done: NICK 

CASE REK)RT FOR OBJECT: EINEN HUT :: 

STRONG CORRECT = ACC 
DOWD just did VERB-WD 

SDDO: VERB-SD (WD-6) 
STEMTYPE: * 

Stem-ending char ((2 P 7) (3 S 7)) 
New tasks: ((VERB1)) 
Task just dones VERB1 
THDO: PRONOUN-WD 

End of search for pronoun ER level = 4 GOOD 
THDO: PRONOUN-SD 

CASE REPORT FOR SUBJECT : ER : : 
• STRONG CORRECT = NOM 
New tasks: ((POSCK-SUBJ)) 
Task just done: POSCK-SUBJ 
JSCHR = * 7) 
JECHR = (3 S 37) 
JSECHR = (3 S 7) 
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VERB ENDING CHECK OK! 

Task just done: V-FVENDCK 

• Phrase division done, sentence type report: 

Pinal verdict is FV-2 , level of confidence: VERYHIGH 

Play by Play account of analysis: * 

Due to: IWITIAX HYPOTHESIS , FV1 set to FV-2 

Confidence in FVT setting is: VERYHIGH 

Summary Comments 

Clause type correct (PV-2) 

'■ END OF RUN 

To follow through an SD.LIST function in operation, 
consider what happens when the noun "Hut" is found. The 
proeraip trace at that point contains the message "DOWD just 
did NOUN-WD". That function returned a cross-reference 
pointer, namely WD-7, which the prograrr; he^^ins to follow 
("SDDO: COMELTA-SD (WS-7)" message). The full structure of 
that element is given ahove; here is an abbreviated form 
for quick reference: 



COMELTA-SD OBJECT' ACC 
NP-SD 

ARTICLE EINEN 
NOUN HUT 



Following the execution, with actual program trace shown as 
comiflents in square brackets: 
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[SDDO: COMELTA-SD (WS-7)] 

COMELT-SD is a function, which, when entered, sets up 
the program and grammar context for the analysis of 
the clause element... the name, "object", expected 
case, "acc", and other variables to hold the results. 

[THDO: NP-SD] 

5JP-SD is entered, and like COMELT-SD, sets up 
a program and grammar context for the analysis 
of a novxi phrase. It then defers further action 
while the phrase constituents are handled, 

[THDO: ARTICL&-SD] 

ARTICLE-SD is entered. This SDLIST structure was 
activated because the noun "Hut" was found. Until 
the noun's analysis is accomplished, this routine 
graciously defers. 

[THDO: NOUIJ-SD] 

NOUN-SD is entered, and completes the analysis of 
the noun "Hut", detercining its inflectional 
characteristics for fiirther checking later. 
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[THDO: ARTICLE-SD] 

ARTICLE-SD is tried a^in, is now happy 
to do its job- It scans back to pick up 
the article "einen" immediately before the 
noun, analyzes its inflection, and returns. 
[Hew tasks: ((WICK TE3''.-12 TE14-13))] 

NP-SD now resumes, checking the inflect" nal 
status of the entire phrase using information 
returned by ARTICLB-SD and HOUH-SD. 

' SUmARY REFORT FOR NOmi PHRASE IWPLECTIOJV 1 

; ON EINEN HUT 

[ Best v/e can do guessine intent is: (ACC S E) 

CONPIDETiCE LEVEL: VERYHIGH ] 

' » GETffiER REPC'HT FOR NOUN: HUl' :: 

STRONG CORRECT = M 
[Task just done: NICK 

With NP-SD finished, COMELT-SD can do v/hat analysis 

it has for the whole element, before it, too, returns. 

[ CASE REPORT POR OBJECT: EINEN PlUT :: 1 

[ STRONG CORRECT = ACC 1 
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Data structure ; 

The results of the analysis are saved in two ways. 
Specific grading observations, such as would be of most 
interest to a teacher, are expressed as (English) messages 
typed immediately at the terminal. All during the 
analysis, the program internally adds the information it 
finds to a data structure representing the student's 
sentence. This structure is represented by a list, which 
at the beginning consists of only the individual words 
input by the student, as given above. The search functions 
(from WDLIST) will chajige the particiilar word entry to 
reflect what they have discovered and decided about the 
word. Thus the simple 
(WORD (KAUFEN (K A U P T))) 
becomes s 

(VERB (WORD KAUFT (K A U F T)) 
((SO F) (SV A U) 
END T) 
CLASS .1) 

END . T) ^ ■ 

ENDCHR (3 S 37) (2 P 77)) 
SECHR (2 P 7) (3 S.7)) 
SS (* F F F SC) (* AU AU AU SV)) 
;STEJ4CHR (* * 7. ' 
TEXT . KAUFEK) 



The various analytic functions thus have uniform 
access to each other's findings, independent of specific 
local variables. in the program. 

An especially significant change in the data, 
structure is its alteration to reflect the structure of 
German, in grouping of words into phrases and into clause 
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elements. Thus the function UP-SD, which handles noun 
phrase analysis, will tDke a constituent article and noun 
which were originally two WORD entries at the top level of 
the structure. They will be combined under a new, sin£le NP 
entry, which also holds information perrtainin^ to the 
phrase as a whole. A similar structural change is done by 
COl'IELTA-SD to group the ©nfet;fe Jaecusative object together 
into a single element at the top level of the clause. Here 
is the final data structure from the analysis of the 
current example ^ v;ith the structure apparent from 
indentation as well as parenthetization : 



( 

(PV-2 
((OBJECT 
(NP 

. ((ARTICLE (WORD ElUEN (E I W E N)) 
(CACLASS . 1) (PMB . EIO 

(ATEXT . EIWEH 



(STEMCHR (* * 7)) 
(text . ICAUPEK))) 
( SUBJECT (PRONOUN ( WORD ER ( E R) ) 

((CHAR 3 NOM S M PER) (LEVEL . 4) 




((PPCREF . PP-3) (CASE . NOM) 

(CASEWHi SUBJECT SPEC) 
(PERSON .3) 
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Piiase Two Summary ; The clause executive ci(®es Phase 
Two of the Eatch/analysis lasing the functions fiesciribed 
above. RUNWD is called ^ach time for a new wetf^ amd then 
a V/D-SKPTR is followed to analyze the aseociated clause 
sajBrnmi-* aoor kee^ps up . until RUNWD signals EIWISHED, 

generally indicating that the end of the student's sentence 
has been reached. (In sentences containing rels^ive 
clauses, FINISHED will ;sevb at the end of &^sch (^i'ause.) 
M -tails tsmm^ ^31 the iiKilST -entries should be .^fefefied, 
and gone; if not, there are v/ords missing from the clause. 
At this point, appropriate action could be coj^Mared. If a 
noun is missing, the jpri^m may fbr iii^tance ita^st to check 
tor th^ possibility of paraphrase by an equivalent pronoun. 
If so, by simply following the cross-reference pointer of 
the missing notiB to its SDLIST entry, the program can 
obtain the necessary case and number information to specify 
the equivalent pronoun; it then generates a PNLIST entry to 
actually look for the pronoun. 

Phase Three ; 

Next comes Phase Three, at which the PNIOiST is run. 
Each term of the list describes one pronoun. The pronoun 
search function is prepared to look anywhere in the clause 
for the pronoun. ;rhe major difficulty lies in establishing 
identity when the clause contains more than one pronoun. 
Each PWLIST entry has a pointer to its corresponding SDLIST 
element, 3.ust as for the major words on WDLIST; once the 
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pronoun's identity ha^ been firmly established, the link 
can be followed iranediately , as there is no question of 
possible clause changes* 

The technique for finding pronouns is conceptually 
simple. Each pronoun is represented by a functional entry 
on the PNLIST.. When activated, this function searches all 
th^e free words of the clause. Any words which ai*e pronouns, 
are compared with the expected value, using a measure of 
goodness of fit* An exact match can be claimed 
iffim.ed lately . If none of the candidates matches perfectly, 
the program will choose the "closest" one.* 

A tie may occur, if tv/o pronouns lie at the same 
distance from the one being sought* A tie can exist only 
if the sentence contains at least two lonclairaed pronouns, 
and the ore currently being sought has not been correctly 
formed* There is a good chance that the other pronoun will 
be correct; or, failing that, at least not wrong in such a 
way as to duplicate the current confusion. The strategy, 
then, calls for the particular search function which has 
encountered a tie to calmly give way, allowing another 
PNIilST element to be activated and hopefi>LLly claim one of 
the embarrassijig alterinatives. The delaying mechanism can 
be invoked at any stage of the function's operation; and it 
can be used repeatedly if necessary. (See scheduling 
discussion, later this section). 



^ See Chapter 7 for a description of the distance metric. 

EKLC 



In a pathological case, it may occur that neither 
PNLIST function is happy with what it sees, and so each is 
waiting for the other. When this occiirs, a modified 
distance-measuring function is tried in an attempt to break 
the tie. If that fails, as for instance must happen if the 
two words are identical, the function arbitrarily picks the 
candidate situated nearer the end of the clause. 

Here is some program output from a simple sentence 
containing a potential ambiguity in pronoun recognition. 
The student is supposed to produce "Sie sieht ihn" ("She 
sees him" or "i t "/masculine ) , but instead uses a neuter 
pronoun: "Sie sieht es". Ily comments are in [j's. 

— RUN 2 (abbreviated) 

THDO: PRONOUN-WD 

End of search for pronoun IM level = 3 FAIR 

[Lid not find "ihn", and no way to choose between 
"sie" and "es", so delaying] 

TKDO: PRONOUN-WD 

End of search for pronoun SIE level = 4 GOOD 

[Search for "sie" finds an ex&ct match, so go 
on to SLLIST work:] 

THIX): PRONOUN-SD 

CASE REPORT FOR SUBJECT: SIE :: CORRECT = KOM 

VERB ENDING CHECK ..OK! 

THDO: PRONOUN-WD 

End of search for pronoun IHN level = 3 FAIR 

[Second try for "ihn". Match results are no better 
But since "sie" has been claimed already, there is 
no ambiguity and "es" can be claimed. ] 

THDO: PRONOUN-SD 

PN SUBSTITUTION.. Should be (3 ACC S M PER) 

is (WORD ES (E S) 3 ACC S N PER) 
CASE REPORT FOR OBJECT: ES :: CORRECT = ACC 

Summary Comments 

Clause type correct (FV-2) 

END OF RUN 
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Many pronouns are associated with more than one 
grammatical slot (see Chapter 7 for a more complete 
discussion of this notion). For instance, "es" serves as 
both nominative and accusative in 3rd person neuter; "sie" 
serves both plural and singular as well as nominative and 
•raccusative. In these cases, the program considers each 
possible slot as if it were a separate word, so that its 
final determination specifies the best choice. Of course, 
if a tie results only from two slots associated with a 
single word, there is no point in delaying, and the program 
will proceed directly to a forced choice. 

Special consideration is afforded subject pronouns. 
A brief check on the finite verb ending tells whether the 
verb agrees with the expected value. (The full verb ending 
check procedure considers mainly agreement between the verb 
and the actual subject). If the verb is not what vas 
expected, there is a good chance that the subject pronoun 
may also be different. In this case, a limit placed on 
the closeness measure blocks the determination of an exact 
match, thus forcing consideration of other possibilities. 
Thus if the verb is singular instead of an expected plural, 
the program must be prevented from locking onto a 
"sie"- plural form without also considering its singular 
representation (and any other nominative pronoun forms that 
may be present). 
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In the event of a tie leading to a deadlock, 
a^eement with the actual verb ending counts positively for © 
a subject pronoun candidate. 

Here is an example in which the program is looking 
for "Er Icf.xuft es" ("He touys it") and finds, instead, "£ie 
kaufen es" ("They are buying it"). My comments in []'s. 

RUN 5 ~ (abbreviated) 

THDO: PRONOUMD 

End of search for pronoun ES level = 4 GOOD 

[success for the object pronoun] 
THDO: ERONOUN-SD 

CASE REPORT FOR OBJECT: ES : : CORRECT = ACC 

THDO: PRONOUN-WD . 
Subject pronoun: expanding. search limits since 
verb ending does not match expected subject 
End of search for pronoun ER level = 3 FAIR 
More than one possibility in search for pronoun ER 

[Only one word, ^'sie", but several possible forms] 
Check with verb ending — 1 choice for sub j pronoun 
THDO: PRONOUN-SD 

. , m SUBSTITUTION.. Should be (3 NOM S M PER) 

is ((WORD SIE (S I E)) : (3 NOM P * PER)) 

CASE REPORT POR SUBJECT: SIE :: COIiRECT = NOM 

VERB ENDING CHECK OK! 

Summary Ccnmients 

Clause type correct (PV-2) 

— END or RUN • 



Phaf^ft Th ree Summary : At the end of Phase Three, 
all of the SDLIST entries should have been processed. 
The previous checks, on WDLIST and PNLIST, ensure that any 
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entry which is located via a pointer from either of those 
two will have "been doner' The additional check at this time 
allows eiemerits which are purely analytic, without any 
match entry. ' "' 

Phase Pour : , _ 

Phase Pour of the analysis consists of checking the 
word order of the noi' totally identified clause. The 
function PHRASEDI VIDER carves out verb/clause type and 
front field , inner field , and end field . PHRASETYPEREPORT 
comments on tliat information and its relation to the 
expectations as present in the description. The comments 
produced by both functions are clearly visible in the 
program output. "Phrase division done ..." is 
PHRASEDI VIDER, "Summary comments ... "introduces the 
PHRASETYPEREPORT comments.. 

Pinally, the entire clause is depressed in the data 
structure holding the student sentence, so the clause 
appears as a single element at higher levels (as described 
above in reference to phrases and clause elements, and 
visible in the example given there) . 

A bit of careful bookkeeping makes two ti'icky 
situations come outfight: phrases with more than one major 
word, and* dependent tsubprdinate) clauses. Within the 
SDLIST processing, the main scan may have to advance past 
the major word which provided the initial entry to the 



SDLIST: for instance, in a phrase containing two nouns, like 
"eine Tasse Kaffee" ("a cup of coffee"). In the example, 
the identification of "Tasse" v/ould take the analysis into 
the nouri pnrasS"; onC'e inside^ the analytic entry it)r the " 
second noun "Kaffee" would be encountered, although it has 
not yet been seen* :RtJrJV/D is called to inake the njove and 
look at the next word of the sentence. Note, in the above 
discussion, that it only sets a cross— reference V/I>-SD PTR 
when E word is found, it does not imined lately follow that 
pointer* In the present case, the procram notes that the 
new pointer leads 1o the element currently being processed, 
which is fine. If a word should be missing, the RUIA^'D scan 
would pick up a major v/ord from another clause element. 
This change will cause termination of forwar^d scanning for 
the ciirrent clause element, thus meeting an important 
criteria by not accidentally claiming words belonging to 
other clause elements. The cross— reference pointer from 
the new word is saved, of course, and will be followed 
after the current analysis is finished. 

Dependent clauses of all types are handled by a 
fiorther refinement of the same techniques 3ust described. 
If the sentence contains a dependent clause, it will -be 
linked into the main clause thj'ough either a WLLIST or 
SDLIST entry , depending on whether it functions as an 
entire clause element of the main clause or- a modif ier 
within some element. Separate WDLIST, SDLIST and PNLIST's 
aro created for constituent members of the dependent clause* 



I-f during its search, R'jNU'L is unable to identify a 
Word as a eitlier a function word, pronoun or major word 
froc! the current clause, it proce-eds to check the other 
*'-V/©LZCT"=^ r- A stfticest^ul ma Lch indicates'" that a*°chan£:e T)f 
clause is needed; the flag WEWCLAUSE is set as indication. 
The level of each clause, recorded during Phase One, is 
checked at this time. If the clause in question is not 
subordinate to the current one, the end of the current 
clause has been reached (according to the assumption that 
clauses will be contiguous unit^) ; the FlNiSHRD flag will 
be set to trigger the necessary processing. 

The existence of more than one clause in the 
sentence means that the formal context of the clause itself 
must be maintained. The function CLAUSER coordinates 
Phases Two, Three, and Pour of the analysis, as well as 
containing the necessary contextual variables; CLAUSER is 
simply called recursively for processing subordinate 
clauses. The separation of WDLIST and SDLIST activity aids 
this recursion. if the mViCLAVSE flag is set, the 
appropriate changes in clause context will take place 
before the program follows the WJD-SDPTJR. Thus, even for 
verbs, which are single-word elements, the analysis work is 
done by an SD-function. Per a clause subordinate to the 
current one, the context change taites place when the 
WD-SDPTR is followed; the pointer will lead to a function 
(clause-SB) which as part of its action maizes a recursive 
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call on the clause executive, with new V/DLIST and SDLIST 
set up for that clause* Thi^ change in clause raay h.^ppen 
at any level of the program* 

Dependent claiises are introduced by jme^x)f £Leveral 

types of subordinating words ♦ These words present a bit of 
a problem to the analyzer* A slight bit of legerdemain 
makes question words and subordinating words come out 
right- Grammatically, these definitely indicate the 
beginning of a new clause. To keep the program on the 
track, cognizance should be taken of the clause change^ 
These words do not have enough lexical content, however, to 
permit unique identification of which clause has appeared 
if the sentence contains more than one dependent clause- 
Further compounding the problemx, German has a great deal of 
overlap in the forms taken by question words, subordinating 
words, demc IS trative pronouns, and definite articles. The 
current implementation for subordinating words uses a 
WDLIST entry which detects a question word or subordinating 
word, but the activation of their WD-SEPTE is delayed, as a 
special case, until some other major word has effected the 
change of clause. It is not completely clear that such is 
the best way to- proceed; reexamination will come as the 
program expands to handle more variety in uses of 
subordinate clauses. 



V/hat does it mean if they are not at the beginning? Some 
O relevant comments are in Chapter 8. 
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internal Scheduling 

Certain aspects of the internal scheduling 
algorithms are unusual enough to be of interest, I present 
them as illustrations of the. kinds of flexibility needed to 
facilitate the analytic task; rone are particularly 
noteworthy on their own. I provide one mechanism f or ^ 
indefinite delay of a task, waiting for some fairly simple 
condition to be met; a way of postponing a task until a 
specific time (the end of the clause), at which time it,., 
will receive special scheduling; and a way of postponing a 
task, based on complex conditions, to allow other similar 
tasks to proceed ahead. 

Indefinite delay ; 

The indefinite delay is handled by having a simple 
list of tasks to be done. These tasks consist of functions 
to do various search or checking operations which depend on 
factors whose time of .availability is unpredictable. The 
task list is checked after every major operation; functions 
stay until they have done their job and are then deleted. 
Por an example, each verb routine creates a simple task to 
determine whether its x^erb is the finite verb or a 
nonfinite one, a distinction which may depend on the 
presence of other verbs. (C.f. "He had run", "He ran".) 
Once that function has been able to make a decision, it may 
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set up a check on subject verb agreement, which in turn 
might have to wait until the subject has been identified* 
Some of the position checking functions are currently 
implemented as delayable tasks, while others have been 
incorporated in the PHRASEDIVIDER functions. The task list 
mechanism remains l.o provide, scheduling flexibility, 
particularly to facilitate the addition of new features 
which may well find their eventual home elsewhere. 

The operation of the task list feature is evident 
in the verbose example of program output. When a task is 
first, set up, the message "New Tasks:" identifies it. 
(Parameters are occasionally passed via "TEM-nn" variables 
to avoid impossibly long printing:)- I do not show the 
attempted executions, but when a task is done successfully, 
the indication "Task just donfe" appears. 

Definite Delay : 

The PKXIST provides a somewhat similar way of 
altering the scheduling, by postponing action until the end 
of the clause. Any search which must wait until the clause 
boundaries have been established can be entered on the 
PNLIST. This feature is currently shared by pronouns and 
separated separable verb prefixes. The PNLIST is separated 
froifl the task list to avoid confusion, and also to provide 
flexible scheduling (via the complex delay mechanism) when 
it is finally executed. 

ERLC 
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Complex Delay ; 

Within the executive used in following SDlIst 
pointers (and also for executing the PNLIST) is a JEechanism 
to allow arbitrary delaying; by any entry. The Si3LioT is 
arranged according to the structure as given in the 
original description- Words riay be encountered in 
different order due either to errors in the student 
sentence or anomalies in the implementation- In either 
case, it is useful to alter the sequence of execution, 
specifically by allowing a routine to delay its execution. 
That routine wiir be tried again after the next routine has 
been done; delaying is not limited until there is nothing 
waiting to be done but routines waiting for each other. As 
the scheduling changes," it will sometimes be necessary to 
scan forward in the sentence: JRUNWD is itself availat>le at 
the lower level executive, with appropriate safeguards for 
handling a change of clause or change of element. The 
scheduling mechanism, currently, used at less than full 
flexibility, allows a function to decide at any point 
that a delay is advisable; even chfangee in the global data 
structure v-an be automatically reversed when the delay 
option is exercised.* I anticipate great©- use of the 



*. I wish to acknowledge a certain debt to the M.I.T. 
Micro-PLMNER system <c.f. ref. 22.3; Winograd> for the 
notion of complete. restoration of the environment when some 
condition causes a routine to "fail". My implementation, 
Q however, is much less complex than Micro^PlANIffiR. ^ 
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complex delay feature as more complex noun phrases are 
implemented. Already, as I have des.Tibed above, it forms 
an important paxt of the pronoun searcher's strategy for 
handling ambiguous matches. 

A slight program trace appears on the fuH output 
to indicate the operation of the complex delay executive „ 
The identifier "THDO:" followed by a function name indicates 
the start of execution of a function which has the right to 
delay. 



PART THREE 



Error Analysis 



The following sections will discuss the task of 
analyzing words and structures once they have teen located 
and identified. After an introductory discussion of the 
general philosophy of errors which I have followed, I shall 
proceed to a detailed description of the analysis of 
inflection and word order. 
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Chapter Six 
ERROR ANALYSIS: PfilliOSOPHY 



The operation of the program can be divided into 
two areas: a "rcatch" portion (described above) with 
responsibility for finding words and constructing some of 
the framework of the sentence; and the "analysis" portion, 
yet to be discussed, which follows the matching process 
with checks on the correctness of the various words and 
other consructions. This checking becomes particularly 
interesting when there is an error. Wy approach to grading 
is. more general than a simple determination of right or 
wrong. Among the many facets are disentangjleraent of 
different errors from one another, and extraction of 
information pertaining to the degree of the student 's 
competence with the subject. I shall expand on both my 
theory. of error analysis and the way it is handled by the 
program, after briefly reviewing some relevant work by 
others- 
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Survey of Other Research 

There is little formal precedent for my kind of 
analytic grading; what discussion exists in the literature 
^ is maTji^Ly--xrieol^<i^t'i 



. .a J"-J,,;>«i J 
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An experienced teacher can easily gj'ade and analyze 
a student's answer (though he probably wiJJL not be able to 
explain how he goes about doing so). The teacher will also 
offer explanations for wrong answers: "that's feminine"; 
"She used dative instead of accusative there."* In speaking 
with several experienced German teachers, however, I found 
that they interpret student errors almost intuitively; that 
they had considerable difficulty expressing rules of any 
broad validity. I do not mean to preclude the possibility 
that more thorough investigation would result in more 
definite findings; for this research, I have chosen 
to concentrate on emulating the grading results, indeed on 
expanding them . 

When one refers to the literature on foreign 

language teaching, the teachers' lack of rigorous approach 

to error becomes less surprising. To quote one current 

researcher: 

When one studies the standard works on the 

teaching of modern languages it comes as a 

surprise to find how cursorily the authors 

deal with the question of learners' errors 

and their correction. It almost seems as if 

they are dismissed as a matter of no 

particular importance, as possible annoying, . . 

distracting, but inevitable by-products of 

the process of learning a language about 

vhich- the teacher should make as little fuss 
as possible. 

<ref. 4.1; Corder, 1967> 



* The teacher may well draw coKclusibhs which he uses, 

gerhaps subconsciously, to guide his teaching. 
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A chapter heading from one of those standard works: 

Foreign language habits are formed most 
effectively by giving the right response, not 
by mal:ing mistakes. 

<ref. 13; Rivers, 1964> 



Rivers goes on to quote Politzer: 

The real skill of the teacher lies riot in 
correcting and punishing wi^ong responses but 
in creating situations in which the student 
is induced to respond correctly. 

<ref. 11 ; 1960> 



(Note: I recognize that particular psychological theories 
of learning have motivated some of the above comments; in 
the current pro;ject, I implicitly subscribe to an alternate 
view of learning.) 



Yet there are examples in the literature of a 

genuine concern for errors. The language teachers have 

recently been wooed by Contrastive Analysis, which employs 

linguistic methods to identify difficulties arising from 

confusion with the student's native tongue. Independently, 

researchers studying the speech of young children relied on 

the presence of errors as evidence that the child was 

indeed using rules: v - ... . 

Even children who have never, studied the " ^ 
rules of . grammar make use of the' grammar ..of 

,, the lajQgusgev This ^ i in Jfche^m 

they mELke. When a child savs. Ee ^oed . he is 
forming a "regular" preterite on the pattern: 
showed , weighed , served : " goed ." His error 
reveals the that he has been applying the 
pattern even though he is not able to 
describe it. 

, <ref . 8.2; Lado, 1964> 
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My work attempts to apply similar reasoning to 
errors found by the program in the student's second 
language work. I believe that much "erroneous" behavior 
can be explained as more than just as a siE;ple error, but 
as a mistaken conception about the"&cope of a rule, about 
which rule to apply, or even about the exact formulation of 
a rule. 

Corder, whose introductory remarks I cited above, 
has some positive contributions of great interest, stating 
that: "our ingenuity should be concentrated on techniques 
for dealing with errors after they have occurrtid." 
<ref. 4.2; 1967>. He has taken the notion of error 
analysis from first language learning and applied it to 
second language work as well. He sets forth the view that 
the speech of someone learning a second language need not 
be regarded as improper formulations of that language; • 
rather, the speech can be considered as a language in its 
o\m right, complete with a grammar. It is a dialect, of 
course, sharing a core with the correct or "target" 
language; it is usually quite individual, and also 
imstable, as continuing instruction brings it more and more 
into accord with the target. (Corder calls the dialect an 
"idiolect".) Prom this point of view, "mistakes" are not so 



, much to be taken as wrong, but rather as evidence of the ' 
particular grammatical system of the "idiolect". Corder 
challenges the. teacher to detect these rules and use the 
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information to guide the student to strengthen or modify 
his understanding appropriately. <ref. 5; 1971> 

I share the general approach propounded by Corder, 
though I have not carried it as far as he suggests, I very 
definit^ly-agree-that the student's performance reflects 
the state of his knowledge, and have attempted to implement 
error diagnostistics on that assumption. (I would note 
that Corder does not explain how to go ahout doing what he 
proposes). That the state of the student's knowledge may 
be cast as a system of grammatical rules is a conception I 
am working towards, but have not yet reached. 

An Explicit Error Analysis Technique 

X should like to elaborate on the theory which I 
have developed for practical error analysis, before 
proceeding to a detailed discussion of the actual analysis. 

Requirements ; . 

I require several interlocking characteristics in 

the grading report: 

1. Rule-specific reporting. The student is taught 
specific rules of grammar. Whenever possible, his 
performance should be related to those rules, preferably on 
the level of individual rules, For instance, a report 
should not read "noun form wrong" but something more like 
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"'-eri' ending, required on this type noun for singular 
accusative and dative, is missing." (Bear in mind, this 
report is not necessarily intended for the student, but 
primarily for the teacher).. . ' 

2. Meanin£,ful error follov/up. (see, particularly, 
Corder ) . A German grading prograjr should be able to 
interpret a form ' 2 "der Buch" as a 'niistake in gender 
(substitution of neuter for masculine), or "die Buech" as a 
good try at a plural form (which should be "die Buecher"). 

3. ' Credit for partial correctness (a corollary of 
the rule-specific requirement). If the student needs to 
use two rules in a given spot ajnd only does one correctly, 
xxs should still be given acknowledge. en t for thrit much. This 
dictuHi applies at all levels; it is, of course, part of 
the reason for such detailed analysis of the sentence, the 
need to separate out errors from correct performance. Even 
within a single word, the notion of partial correctness 

may have meaning and should be pursued- Thus in "Der Buch 
ist alt" ("The book is old"; the German should be -"Das Buch 
... " ) , I would like to credit the student for nom inat ive- 
case" in the article (probably), even tiiough the form of the 
word "der" is not totally correct, 

-..,4- Strong and weak correct. The rule-specific view 
of reporting can be extended to include^ correct as- well as 
incorrect performance. Along this line,' it may als^- be 
pedagogically useful to know how much of 'a discy-imination 



was involved in the student's correct answer. Por 
instance, the article "derV in "Der Mann ist alt" ("The man 

is' 61^^* )" ' couM report ^?<i .directly as a correct . — 

application of the rules for indicating;; gender (masculine; 
and case (nominative) in the definite article= In manj' 
instances, however, there is such overlap of forms that a 
definite diagnosis is not possible. Thus the article 
"das", in "Das Buch ist alt" ("The book is old") indicates 
gender ' correctly "(neuter )7 but" in " indicating case* dofe's 'n'dt * 
discriminate between nominative and accusative. I will 
term this lack of discrimination "v/eak correct", in that 
one cannot satisfy a strong assumption of correctness. 

Descriptive Framework; 

In order to address the question of error in a 
systematic way, I have set up a descriptive framework 
describing the major steps in the sequence of rule 
applications leading to tlie final form of the word. I can 
then use this framework to guiv^e the rule-specific 
reporting. Meaningful error is then defined as a deviation 
which can be interpreted a. failure or misaplication of 
a rule somewhere in the framework. 

The simplest example of such a framework shows a 

threer-step process: cue — , intent — result. Por example. 

Cue: " Mann" 
. Intent: masculine (nominative) 

Result: "der" . 
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For grammatical items this three--step division represents 
the iDeginnine of a system which takes into account the 
existence of interesting mental activity between the cue 

and the final f era In. particulars, . pne v/ould Ijke to 

examine the ways in which the student will activate and 
apply various grammatical rules* 

Tiiere is no way to see the middle step, the intent, 
directly. One can, of' course, ask the student direct 
questions about the riiles he,, is using, and perhaps extract 
some very useful information. These quBst ions involve 
interaction with the student, however, and thus represent 
teaching activity in their own right. As such, one can 
evaluate the desirability for the direct que^ioning, but 
it is outside of the realm of the present project, i.e. 
extraction of information from the response alone. 

If it were' possible to see^ inside of the student's 
head, the grading method of choice might be: 

cue intent ^ 

result 

(I.e., major analysis concentrating on the steps from cue 
to intent, and from there^ to result.) There would probably 
also be a check directly from cue to result. All that is 
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available, however, is what is visible, and so the method 
must be: 

cue result 

(intent inferred) 

(I.e., moving directly from cue to result, and inferring 
from that what the intent might have been.) If the 
inference is of suffj-cient quality and detail^ then one can 
reconstruct the more desirable cue - intent step as part 
of the ansiysis. ^Suman teachers seem to be able to do this, 
thus providing the computer with a challenge. Someday, 
perhaps, psychology will provide a good theory of mental 
functioning to explain situations like the present one. 
Por thfi present, a black-box approach seems to provide the 
best approach. I construct an algoritiunic framework which 
can be presented essentially the same input as is given to 
the student, and which will produce the same result. Prom 
its performance, I then make inferences about the student's 
command of the subject. To avoid asiy mi siAnder stand Irg, 
the analysis is not done "by synthesis"; the program does 
hot blindly try to generate a. form similar to that used by ^ 
the student and then use its own actions for a diagnostic" 
report. Bather it attempts at each stage to figure out 
what the student has done,, using its information on what is 
expected, what the final result was, and what kinds of 
error possibilities exist. 



At times the program will be unable to nske a firm 
diagnosis of what the stud<!'T~t intended. The source of the 
diagnostic difficulty ma-.? - , ajn,.iinf^ on 
the part of the student, which the program could not 
interpret (other than of course detecting that the form was 
erroneous) o Or, in many instances^, a basic ambiguity in 
German gramnjar will foim an impediment to full 
interpretation; "weak correct" is a manifestation of this 
latter problem. In either case- the grader program appears 
to be in an advantageous position for not only reporting on 
its findings, but actually pointing to ways in which the 
curriculum might be altered to elicit a response fc^iat woul.d 
shed light on the particular confusion. 

.The following sections will describe the specific 
applications of the general theory of error analysis to the 

t. 

specific problems of inflection and word order • 
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■ • Chapter .Seven , , , , , , , . . , . . - . ^ ^ 
— - — INHJECTIOKAX-AUALYSIS - 



Inflections "The change of xonn that words undergo 
to mark such distinctions as those of case, gender, nuinher, 
tense, person, mood, or voice*" <ref. 20; Webster> 

German is a highly inflected language; accordingly, 
the word-match routines have been specifically designed not 
to be sidetracked by inflectional errors* In this section, 
I discuss the analytic portion of the pro^rain which is 
respionsible for checking inflection. 

Terminology 

In this discussion, the term "grammatical category" 
shall denote one of the set of distinctions: case, gender, 
number, tense, or person. An inflectional change shall be 
said to "indicate" the relisvant category- (These two are 
standard terms. ) Purthermore, I use the term "value" for 
the particulai- choice within a category. A noun, for 
instance, inflects to indicate the grammatical category of 
number (or more briefly, to "show number"). At a , 
particular instant, the appropriate value in that catej^ory 
might be "singular". y.v '---^ 
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_ In general, a particular inflectional form will 

have to reflect more than one grammatical cate£;ory* I use 

the term "slot" to denote a specific set of categories and 

their values, with which is associated a pai^ticular 

inflectional form* (A form may be associated with more 

than one slot.) Por example, articles must show case, 

number and gender; a slot will be a set of one value for 

each category, 3uch as: nominative, singular, masculine. 

^ -The -fallowing* abbreviatiojis are -used» thrOu£:i:iout — 

this report, as well as by the program itself: 

case: nom — nominative 

acc — accusative 
: dat — dative 

gen — genitive (not implemented) 

number: s — - singular 

p — plural 

gender: m — masculine 

n — neuter ' 
f — feminine 

In aziy category, may occur to indicate an absence of 
discrimination: >ill match any value. 

The slot associated with a particiilar inflectional 
form may alsc.be called its "characteristic". 

On occasion, 1 may regard the ijndication of a 
category as a "constraint" on the word in question, and 
speak of satisfying the constraint* 
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JFramevork for Inflectional Analysis 

The previous discussion on error philosophy and. 
rule framework applies most strongly to inflection. 
Accordingly, I have a framework; setting forth four ma;3or. 
steps: 

1 . What grammatical categories must be indicated? 

2. Por each category, where is its value determined? 

3. For each category, what is that value? 

4. What is the proper orthographic form to 

show all the values? 

(Note that "intent" now comprises steps 1, 2 and 3; 
the "result" comes from step 4-) 

/\n. example may be utseful. (for simplicity, I will 

omit the category Number.) Consider the definite article, 

"den", in "Ich sehe den Wagen" ("I see the car"). 

1. Must show: case, gender 

2« Governors: case: clause syntax, verb 

gender: from noun "Wagen" 

3, Values: case: accusative 

gender: masciiline 

4. Porm: "den" . 

Step 1 is so basic it is easily forgotten. The 

computer, of course, needs explicit instruction on what is 

required.. Generally the student will also have been told 

slTJ'ilar rules, which exist in his grammar book even if not 

o - , - , 

ERIC 
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in the material as presented the course. If he forgets 
the inflection comFletely, the error is comparatively easy 
to spot: an attributive adjective will appear without an 
ending, or a verb in the infinitive form, h'cre difficult 
is the situation in which one cramiiiatical category was 
forgotten, as word:.- which indicate more than one cate£;ory'" 
usually" do so with a single form, as in the above example 
"den"» (The alternative is illustrated by the English 
pattern: man - men - man's - men's which shows nlural and 
possessive indepeniientiy, ) Certain possible errors ma^ 
suggest a step 1 failure « The definite article, for 
example, iscften introduced as showing only gender, not 
case, usin£ its noiainative fore:.* Thus the student who 
forgets to indicate case might well iust reproduce this 
nominative form; occurring where the correct case is dative 
or genitive, this might point to the step 1 error. ^ 

Step 2 constitutes the formal statement of the 
"agreement" requirements (which clearly can also be phrased 
as a requirement that both - words indicate the, same values 
for the relevant category.) Viewed from a slightly 
different angle, this states the need for internal 
consistency. The verb, for instance, must ag^ee v/ith the 
subject of the sentence as written; if that subject is not 
the expected one, the verb still sliould show the internaj> • 

* compare the following statement from the Reference Grammar: 
"The plural article, for all three genders is. 'die'." 
<ref. 9.2; section C1 01 > 
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agreement. Similarly, if a preposition has been changed 
from the expected one, thp student must apply the case 
i^equirements for the pr^pbsi tion he has actually used. Of 
course, he should he credited for following these precepts. 

. i 

When a substitution by the student results in a 
change from the expected lvalue oi the v.'ord governing a 
particular inflection, the grader must be sensitive to both 
the expected word and/the one actually used. Agreement in 
form v;ith the expected word is a Ifind of step 2 error: the 
inflection rule 'requires "agree^^ with" the actual 'word. 
Possibly, however, the substitution is only in the form of 
the iToverning word, not its intent (i.e., a step 4 error on 

that .'yor<i'X;--t.i8-- G*-wd€Rt-teisfet- >ti&€R-i)s- •■using-M«s- intended ' - - 

1 

value as the source for the current inflectioriJ Such 

i 

guessing can become very trickj^ but the expect|ed value 

1 

represents one good hypothesis about what mightibe 
intended and is well worth checking. Another possibility, 
producing much the same result., is that the student learned 
the inflectional pattern by roE,e, but has forgotten, a 
middle step, the form of the word which actually governs 
part of the pattern. JFor example, he may recall that /"to 
count on" — "rechnen mit" — - requires a dative object, and 
use dative even if he forgets which preposition to use and 
substitutes one which itself requires accusative. 

There are not too many instances within the 
language itself in which the. student can be expect e^ to 
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make a wron^; choice of governor. Several distinct 

possibilities which do exist are worth noting briefly. 

. -The"^ gender - of a ..compound . no::An Is deterinined^by^-,. . 

the la^t component noun; 

—The verb in the form "Das sind xxx'' ("Tliat is xxx") 
a^ees in niimber with the complement, not the 
norcially-sin^Mlar subject "das"; 

— ^For certain neuter nouns which have "natural gender", 

such as "das Maedchen"^ pronouns will agree either 
with the grammatical gender or the natural gender, 
depending on the physical distance between noun 
and pronoun. 

In b11 of th^s,^^ cases , the pro^7;am caji certeinly^^b made 
sensitive to the possibility of step 2 errors* 

...^ . -ifeving 'I'ea^ the student just may not 

know^' which vc'^ue is appropriate, particularly when that 
happens to be a * n gender* I find the distinction 
between steps 2 and j> intriguing due to the prospect of 
giving the student credit for an inflection even if he errs 
at the point of determining the value for a particular 
graimnatical category. Notice that an error in noun gender 
is really a problem v/ith the noun, even though it happens 
to show only in the formation of the article^. 

At steps 3 and 4? the student must command a fairly 
large amount of information to perform correctly. In some 
instances, the information .constitutes a c3.osed set, albeit 
a moderately large one- (e.g., adjective endings, irregular 
verbs)* In others, the complexity of each inflection may be 
less ,:„but_ the size of -the set is unlimited (e.g. , nouns, 



gender and formation of plural). Lots of thin£;s to remember 
means, generally, lots of possible errors. I have 

accordingly de /oted quit e a bit of eff orjb Jio hand^^ 

3 and 4 errors. At the. same time^ I must add that much of 
the information the student needs is merely factual,., 
learned by rote, which makes for rather uninteresting 
errors. (I'd much prefer to chase an improperly learned 
rule.) Even uninteresting errors need to be caught. 

The detailed analysis of errors at steps 3 and 4 
has two ■ dividends beyond the basic necessity. of. .error 
detection. Both lie in the direction of sharpening the 
precision: with which the program can diagnose just what the 
student knows.. 

A) Pi'oper object of error: 

A real step 3 error belongs with the word which is 
the source of the constraint as much as with the word 
showing the improper inflection (unless, of course, they are 
one and the same word). Usually, the source word will not 
itself inflect to indicate the category' in question. 

* a- 

f .... 

j Thus a student who thinks that "Wagen" ("car''-)--is 
feminine will say "die Wagen" instead of "der Wagen" in the ^ 
nominative singular. As the error comes from the noun, it 
will also affect other constuctions like the indefinite 
article and adjective, endings as in "eine rote Wagen" ("a 
red car"), or ;grcnoun gender a^; In "Hier ist eine Wagen. 
Sie ist huebsch" ("Here is a car. It is pretty"). 
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On the other hand, the student's fcro "die Wagen" 
might be due to a step 4 error; i.e., not knowing the 
correct form of the ar tic3-e. Th en the e rror Indeed lies 
' with the article, not the noun. 

In an instructional system that tries to respond t 
the individual cyiaracteristics of each student, correct 
placement of credit for errors carries major pedagogical 
implications. 

B) Disentanglement to find out v;hat is correct: 

As noted above, the German -^inflectional system 
usually combines all the constraints into a single form at 
step 4* There is fio quality of the nominative masculine 
ending which expresses the nominatival as distinct from 
masculine in a way that might be expected to carry 
through tO;.- say, a nominative feminine form. A nistake 
in indicating one grammatical category will in general 
produce a form in which the correct performance in other 
categories is not immediately obvious, and it is only 
through systematic analysis of the possible steps in the 
production of the i'orm that the grader can properly assign, 
credit. 

for instance, the definite articles 
<dat p m> — "den" 
, <acc p m> — "die" 
<dat s m> — "dem" 

o-' * /' , ^ " ■ " ' ■■ • ■' ■ ■ ■ ■ 
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A "dem" where dative plural is expected will be diagnosed 
as proper case, wrong number; "die" in the same slot, as 
correct^'number but wrong case. But these diagnoses come 
not from the orthographic form itself, but from conclusions 
.gained by following, back to a point at which the various 
grammatical categories can be separated. In this 
circumstance, establishing meaningful interpretations cf 
errors serves a more important function than merely 
indicating how the student failed in one category; it 
a'ilbws "r ecogrii t ion , of ; a s im ul tar.eou-s • corrfec t pferf orm^ance in - 

another category- 

Unfortunately, things are not quite so simple. 
There is no a priori way of determining exactly what the 
student had in fflind (see above, Chapter 6). In particialar, 
any given error might be due to step 4 (wrong form) just 
as easily as step 3 (v.rrong value) or even lower. Each 
interaction with the student provides a little window into 
the state of his mind. Someday a p ogram might exist 
which would record all performance and thus be able to say, 
yes- Bill thinks that "Wagen** is feminine, so expect step 3 
errors and forms like "Hier ist die Wagen" ("Here is the 
car"; correct German would be' "i;.der Wagen'')* Some cf 
this student history might come from grammar drills, whose 
direct questions provide small but clear windows. The 
grader's analytic capabilities might well be put to use 



even before presenting an exercise,- to determine whether it 
would be likely to provide the desired a^nfOrination or just 
lead to a. dia^osi tic ambiguity. 

There are quite a few possibilities relating to 
errors which can be covered already, i.e., without^ recoiijrse 
to the individual student performance history 1 Some are 
from general errors as observed from traditional teaching, 
others from logical analysis of the gramnar and the 
rule-application foramewcrk. A [incorrect] form will either 
be a German^ inflectional pattern or- it will not. In the 
latter, probably less cominon casQ, -there is by' definition 
a step 4 error: the form, i-s not correct. Unfortunately, 
until the program has history information available, 
nothing more can, be extracted firom-an unrecognized for n:; it 

is just ,wrong» . . ' - ' ,c 

^ • • '"^ss " ' . ■ • ■ ■ . ' . i 

If the inflectional form is recognize^, perhaps a 

simple explanation can be found - to explain an' error. Much 

more detail on this procedure follows below* ^ General " 

principles inc4:Ude the following: ' . 

a) If the mesmingful exp)lanation postulates an 

intent too far away from the expected values, i would 

incline away from a step 3 explanation, ^proba)?ly towards 

step\4 except for the special circumstances discussed 

above N^I^'d not ^expect, . for instance, a genitive form where:; 

nominative belongs* *' ♦ * " 



b) If the governor of the inflection can te , 
identified as likely to cause error/ a bias can be 
introduced in favor of * that (^step 3) e:xplanaticn. 
English-speaking students, for ^instance, tend to be quite 
prone to forget noun genders. ' ... 

c) If other words involved in an inflectional group 
are themselves inflected, the extra inlormation can be used . 
to throw light on what the student had in mind. This 
situation rapidly becomes quite complex,, see discussion^ 
below* ~ \. ■ . • 

^ Thei^e exists one very bad complication: the 
iricompletenesa of the inflectional system. (Incomplete - 
in a mathematical sense.') Some forms serve for more than 
one value,' which results in a weak indication of the 
category in quest ron. (eog.-, only masculine articles 
distinguish' between nominative and accusative case.) Also 
some ^orthographic- forms appear in ^everal, apparently \ 
unrelated slots. ,("I)er", for instance, serves both <noin s 
m>, and <dat s f>.) The result is an inherent ambiguity in . 
the process of reversing the formation r^es (step 4). I 
have worked out varioLis ways of coping with the 
ambiguities, ways which are valuable as illustrations even 
though lacking the specific student data which' will clearly 
be needed to achieve the full potential of the grader 
system. * . 



DetejnniTiin.g What tiie Student- Intended 



Three Jlethods 



I have explored a n mnber of approaches to the 
problem of arriving at a reasonable guess as to the' 
student's intent. Two ciethods have been implemented; a 
third proposal may be even better. .Briefly, the firs't • 
'involves built-in tables giving intent hypotheses' for all 
coinMnations of re^iponces and Expected pharacteristics; the 
second method is computational, choosing among alternative 
explanations on the basis of :their "distance" from the 
expectation. The third is also computational, working with 
conditional probaMli ties ^ related, to^ the student 's • ^ 
performance. > ^ 

1. Tabular Method: 

The fir^t approach,^ the tabular, is currently 
implemented for article-noion inflection analysis; it is 
responsible for 'the comments in the sample runs- This 
approach : employs a direct table lookup to produce an > 
estimate of what the student intended- The two input 
parameters for the table are the expected slot and the 
observed forin^ Pgr each combination, 'tlie table gives one 
or more slotsr, possibilites. that the student might have, 
intended. V/ith each slot is 'associated an indication of 
the level of confidence I place in the particular 



prediction. Por ex^mpl^, consider the ar'ticle ending 
«--er", which is- correct for two case^gender slots, <nom s' 
m> and <dat s f>. If this ending occurs in a :situa-tion in . 
which the expected case, is accusative, the student may have 
one of three slots in mind: <nom s m>, <dat s f>, or 
perhaps even <acc s in>. Although the latter s never 
correct for "-er", it would-be consistent with the i^ttern 
of. neuter and f emir ine endings. ' • ' , 

For a masculine singular noun, the table entries- 
are as fallows i ' ^ 

' <noDi s m> "medium" 
<acc s m> - "high" 
<dat s f > - "low" 

where "medi^am", etc.',. are the level of cbnfid^nce 

indica:tors for each slot, given on a rough soale. 

' If the correct noun gender is feminine, still 

expecting accusative, the entries are different: 

Xnom s m> — "none" • . 

<acc s in> - "med" - • , , 

<dat s f > - "high" ^ ' 

V For masculine (expected), then, the best guess is 

that the siTudent intended <acc s m>, with "higl:i" confidence 
in the guess. Interpreting according to the generkl . - 
four-step fscheme for inflection, 'this represents correct- 
intent but incorrect form (step 4 error). For a feiffinine 
noun, a "high" confidence guess of <dat s f> means an error 
in intent (step 3) combined with a correct form.. • , s 

* : A complete.present^tipjfj^f the table, with fixrther 

explanation,' comprises apipendix 



Evaluation of method 1 : As currently implemented, 
the taltular methBd fias both good and bad qualities. I . 
. desired a demonstration of how intent— guessing could aid 
the grading operation. The table has been successful in 
providing plau:^ible guesses to illustrate both parts of the 
concept: immediate diagnosis, and the further analysis 
possible once the data is availajDle.* 

" The table has a very ad hoc quality; gra;mmatical 
facts aside, al^ the plausible alternatives and 
particularly the levels of confidence are my own creation. 
Actual values are necessary for the demonstration; 
presumably the various entries can be adjusted as ^ 
experience is gained abdut^hat students actually do. 

The tabular aijproach has the advantage of . 
providing very sipecific control of the various , , ^ 

alternatives i Since each entry is given directly and 

"•■-■* , * ■ ' . "■ * • • * 

individually, there is- no need to juggle numbers in order 
fo have two interacting slots both come out^ correctly. 
(See/, particularly, the following method* )* This 
specificity is at the same time a most severe disa?! vantage.^ 
Ihe table does • not contaiii any. direct rjelation to general 
Instructional parameters such as "student tends to use 

feminine instead of masculine**; thus there is no obvious 

■ ■ . ■ - ' - ■ ' ■• V ■ ' ■ ■. ■■■■ - ■ ■ ■. ■ ^ • ' • " • --.>'. 

way for properly incorporating feedback. Purthermore, the / 

table is noncomputationfiLL, living no direct way of 

expanding it to coyer other variables, such as word type^^, * 

or to handle, interaction with other in: ott i .i/ords, . 



" id' ^ 



. ' To meet some of these objections, the table was 
designed to provide^ more than one possiblity (intent-guess) 
per entry, with varying probability/confidence levels! In 
order to reflect the ' influence of other criteria, the 
coRfi'derice levels could be systematically. altered. The . 

table itself specifies som^ case-gender entries merely but 
referring the other entries with directions to. diminish the 
confidence levels.. No outside- use has yet occurred; it is 
the nature of the table to want to specify everything- 
individTjialljrrather than by broad classes. . . 

Another shortcoming of* the table stems again from 
its very specificity: information beyond that explicitly 
present' versus just not accessable. The lack is particularly 
galling when the variables- I need were themsel\os involved 
in my deliberations when setting up the table. The data in 
the table does not suffice for a determination of whether a - 
form is indeed correct; that finformat ion would have to be 
added, albeit at minimal cost. Some distinction between 
strong and weak correct has been^ incorporated in the format 
of the guess specifications. If . a form covers two values of 
a category, both are included in the slot specification, as 
in <(nom acc) s f > for "die". But that information sti'll 
applies only to the intent-guess, not to, what actvially is 
correct in German grammar.' The latter could also be added, 
. or even obtained computationally, once the correctness * 
information is added. - . \ ^, 



Yet another frustration occiirs when the; dia^osis 
. is either ambiguotts or conf use\i.' The grading program ought . 
to be able to Took atM)hat has confused it, and communicate, 
that- infonnation to guide 'the teacher in producing a - 
modification of the exercise which will elict a more 
informative response. Again," the most' convenient course is • 
to expand the table to include the necessary information 
explicilly. "Philosophically, I disagree with the general 
approach of adding everything directly < to the t*:.ble. If a ° 
"systematic procedure is involved, better that the program, 
should be expanded to include the, pert irient ^gorithm. 

f . * * 

* ■ . K ' 

■ ^ _ 

P. Distance Computation Me thod; • • 

The second method, the computation of a distance 
metric,. is currently employed during the pronoun search. It 
was also used at ,ohe time for article-noun inflection and 
a modified form is still operational for verb ending 
checks . . This technique . involves, generating a metric In the 
space of grammatical alterriatives, so that a "closest" , 
y alternative can be chosen. When applied to a^ single 
v^iable, this technique is very straight forwiard; exact . 
matches are closer than mianatches* Thus,^ to continue the 
above examples xisins article inflection, the student might 
provide "das" when expected to produce a nominative neuter 
form* This form is correct for both nominative and 
accusatisfe; the former, being cprrect,' is closer and thus 
chosen. \ 



A finer rajnking is often feasible, even with only 
one variable. For instance^ -the student may use the same ^ 

. " - . ' • : ■ ■ - " ■ 

;^das"-when dative case was r,eq;uired. Kow neither case 

• - . - ■ 

matches exactly, but dative is "closer41 to accusative than 
it is to nominative (both he^ifi^ object cas^), and so a 
decision (cuess) could' be made for' the accusative. 

This method leads to a sharp dichotomy, which was 
evident in the last example. The choice of the closest of 
the characjieristics associated wih the actual response 
(accusative, above) provides a guess 'of . what the, student 
"intended assiamin^; that no step 4 error exists <i.e.^ that 
the orthographic form adequately reflects the intent). The 
characteristics associated with the expectation represent a 
viable alternate guess, under the assumption that there is, 
an error in the- form itself;^ The program is in a somewhat 
1 embarrassing situation, -with no"vay "to make -the choice and, 
moreover, .no way to find a middle ground. At one -stage of 
experimentation, I simply had the program carry along both 
values for further processing. 

When two (or more) variables are involved, the 
distance metric becomes more complex and also more 
interesting. In simple examples, the number of changes 
plays a .princiipal role: the distance is directly • 
proportional to the number of >gi;iables (or categories) 
involved whose values differ in the two terms being 
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compared- Thus, with an expectation of <dat n> and an 
actuial tdrm "der" .- <noiJi in> & <dat f>, the analysis is: 

•<dat n> to <noin. m>' : 2 changes, case and gender, 
<dat. n> to <dat f> : 1 change, gender ' ^ 

and thus the second is closer". The plot thickens if 
there is more than'^ne possibility with the same number of ^ 
changes. ..Change the example to an expected value of <dat in>: . 
^ ■ <dat m> to <noin m> : 1 change, case ■ 
<dat iD> to <dat f> : 1 change, -'gendei* 
Here a system of relative weights is needed to complete the 
choice. Case is probably ir.ofe significant than, gender, so . 
it can be given more weight (i.e., a greater di-stance is- 
involved if case must be changed). Then the guessed intent 
in the last example will be <dat f>, chosen because it is 
closer than <nom m>. • • y 

This method is quite appiealifig. It is simple, 

yet easily extensible to handle additional grammatical ^ 

■■ oategorie^s; it is computational j and it makes iise of general 

observations of grammatical performance. Hot surprisingly, 

this is the technique mentioned in .the literature the few 

instances where eiror interpretation- has been considered. 

Halliday has this to say: , 

I It- is importairt t^o note that a given en^or *. 
can be often be described 'in two or three ' '.\ " , 
/-ways, to each of .which -corresponds a • ^ . '\ 

different step that could -be taken ta correct 
it. Jot. example, /"he aslSed a new, book' coiAld' 
be corrected either to 'he asked for a new , 
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. "book' or 'he reqbested a r.ew book'; the§e 
will lead to two different aru^yses of the 
error,' in tliis case as it happens at 
different levels: the one grammatical, the > 

^ other lexical. Both analysies are valid. 
. Descriptively, the analysis yhich 
yields a simpler correction^will be 
preferred. , Asked for' anti 'requested' are 
each minimal corrections^ in the sense that^ 
each requires; only one step; but 'requested' 

" involves a change of register and might 
therefore be inappropriate. 

\ <<ref. 6; 1964> 



Evaluation" of method 2: 'l^either- Halliday i^or 
anyone else gives any more ^specif ic guidelines on the 
weighting of different errors. In trying to set up. a 
working version of the scheme, I encountered some severe 
siiort comings. I have already mentioned the lack of a. 
middle ground, a compromise between the two extremes of 
expected and actual values. By afssociating- initial weights 
with, each actual ending/valiie pair, it should- be possible 
ta introduce plausible error a3,temg;tives into the basic - 
dsta set, .just as in the tabular method^ the initial- . 
weights • would be such as tp place these alternatives at a • 
disadyajitage except under cer^^sain error coarbination^. 

In the end, the sheer we^^ of all the different 
parameters involved with nouns and ait^ticles caused the 
procedure to become unmanageable. As the nvmher of 

variables increases, so dpes the necessity of retaining 

? V ' ■■ • _ , 

unattractive terms at intermediate steps in the computation 
on the chance that they coiald later become significant. 



I round the algorltiim required an excessive amount of 
thrashing to accomplish even minimal goals. Either there 
would be'' top. many final possibilities or one that I was 
interested in was beloiv the cut- off pqint. Of course', this 
behavior could be due to bad tuning, but then tuning itsel:^ 
represents a major problem. The particular way in which . 
en<3lng patterns interlock is such that mucji of the 
supposedly general analysis is really oriented towards 
special cases. (.Per instance, "der" is the only form 
appearing in two fiAlly separate entries, <nom m> and 
<dat t>). The relative weights used for the distance 
Calculations, .which were initially ad 'hoc values, had to 
be ouggled time and again to get the special cases to i 

turn out as desired^ With each adjustment, the parameters 

(-■ ■ ■ ' . 

moved further from the world of the students 



Pronouns : For pronoun checking, a more restricted 
domain has led to a moderately successful implementation 
of this second method. The distance. metric plays a> vital 
role in the actual i!ia.tch/identification step,, enabling the 
search logic to chose which of several candidates is closer 
to its target* (Por a fuller discussion of the search 

strategy, see Chapter 5») . 

'J ' ■ ■ , 

Pronouns ii:i.icate five grammatical categories: 

person, case, number, gender, and type (the latter 's values 

including personal, familiar-personal,'^ and demonstrative). 



The algorithm' assigns a inetriq value froifi 1 to 4 to a match 
^^tween two slots. Comparison is aGne:left to right in the 

order given above, discontinuing wherisa mismatch occurs. 

Specific value assignments are: \ . 

1' — when -nothing patches (mismatch on person) 5 , 
2' — when only person matches (mismatch on. -case); 

3 when person and case are okay, but number, gender, 
or type do not match ;^ 3 is also assigned if, the 
case V mismatch involves an interchange of accusative 

' ^ and Native; 

4 - ' when all five 'categories match correttly. 

Uncier some circumstances, a second metric is needed 
,to try to resolve a distance "ambiguity i.e., when two 
^comparisons yield the-- same value - by the previous algoritlun. 
The second metric is arrived at by checking all five 
-categories, and^ counting the number of mismatches ^without 
,,usihg any weighting function. ' ^ - 



Verbs : For checking verb inflections, the simple 
coFiparision scheme has been quite satisfactory. Basically 
a yes— no technique, it checks two slots for intersecting 
"specifications and returns either nil or the ^ intersection 
value. A fuller discussion of it^ use will be found a bit 
later on. ^ . - , > 
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3« Probability Computation Kethcxfl : 

The third method, ^ using probs.bilitii^s, represents 
an attempt to inc or iXDi*ate the bet t«^T- points, of the two 
'previous methods. It has be' in fair detail, . 

but has not Vet been , prbgraiL,. ,ie following discussion 

unfortunately does not have the force of exp^ience other 

' ■ , ' . ' / / - . *• 

than a little hand Simula tioii; The technique is -grounded 

■ ■ ■ ,. ' ' ' ■ 11^^" . - -. ■ . ' ' ' ' ' . ^ 
in the simple three-step, theorj' - of hpw the student .goes 

about generating an inflectdonal form: he receives a cue . 

(expected value) , derives an intended value, and from that 

produces the actual* form- With each step, X associate a 

probability measure: for a specific cue, the probability 

of a specidTic intent; given a specific intent, the 

probability .tliat a certain ending will result. Several ' 

useful predictions result from simple combinations of 

these two basic probabilities. 

.'Let: ' " ■■■ .. 

~ P( intent [i]) be the protiability that the student intends 
,; a case-number-gender slot I; . , 

P(foiTi[j]| I intent tl]) that he produces form[ j] given 
tends slot I.- 



that he ir 



The product bf the two gives: . . 

-;=^(form[j] & in\ent[i]), the probability that he will, 
•produce the right form f^^ the right reason (intent): 
ERIC ^i^ormU'] & ihtent[i] ) .= P( f6rm[ j] | intent [1] ) * P( intentCi^) 
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The probability of prd&\ucin^ a form regardless of intent is 
the sum of all the applicable, terms: , 

P(form[jJ) = sum(all- {P(forin[ j] I iTi[tent.[i]) * PCintentCi])} 

. ' " ■ . ' ■ ' ■ "I 

Note that the expected case does not follow any 

probability distribution, and so the P(intent[i]) is not 

strictly a conditional probability even though referred to 

a given expectation. Different values will be needed, . 

though, for each expected slot. Additional conoiments on - • 

this potint will be found in appendix C, along with a^ 

complete listing of su£tges ted probabilities, \ 

In this terminology, an intent-guess corresponds" to 

an intent [i] given j3ot^ 

form[j]. The probability expression is 

P(intent[j] I °f orm[i]) It can be calculated (with the 

available information) using Bayes Law: . ' • 

P(intent[i]. I form[ j]) = ■ ' - 7 

P(form[j]''l intentCig) * P(intent[i])^ ^ . . ^ 



sum (all k) {P(form[j] I intent[k]) * P(intent[k]} 



FRir 



The program need only make the computation- for all relevant ° 
ixitent[i] slots, and chose the one with the highest 
probability. As. a given form will not have any 
signf leant probability of occurrence in any but a. -few 
slots, the calculations need not be excessively - longi _ ^ 



Some examples may be helpfiol. 
P( intent): 



,,If the de.sired case is nominative^ one would hope 
that the student would intend just that, but there is also 
seme c&ice that he will chose another case. Assigninfi 
specific probabilitiqE ' his sitiAation/- P( intent ) , gives 
the following table. To in forming; a. concrete 

demonstration, I have also assigned, ad- hoc, numerical " 
values to each probability » . ' ^ 

Expert: Inominative 

Intent: nominative, P(nomaiomV 92 
accusative, Ptaccnom^ = .05 

. _ — — --^ dative-,— — — p(datncan)- = .03 — -- — 

,■ ■ . : J^- \ ;^ 

P(form[ j] I intent [i]) : 

•Given an intent, what is the probability that the^ 
student will produce a given ending? (This is a true ^ 
conditional probability.) Again associating specific |' 
values for clarity, here is a partial table for . • - >. 
P(form[^]* I intent[ij): 

■ Intent: nomingitive masculine ... " 

Portfs: "-en", p{en|nom-mV = 

"—eni", ir'iemlncaa— m) = *05 
«-«er", P(er|nc»B--ini) = .10 ^ 

/•_ . . ■■ /■ ' . ■ ■ ■ 

Note that anticipated errors, must be included 

expGLicitly in the forms table. 



. ■ * = . . * " • . 

P(form[ j]) : ' ' 

By combinijTg the two atove probabilities, it i^ " - 
possible to predict whether the student will come up 
with a i)articular ending for a. particular" reason: 

P(form[j] & intent[i]") = P(form.[j] I intent[i]) * P( intent [i]) 

The pr-^' -i-,: .gj^ be expressed for any situation. 
Thus^ ±'or xnstance, if the expectSition were acciisative 
masculine, the correct ending would be "-en"; the chance / 
that -^e student would in fact give that ending is mainly: 
P(en & acc-m) = P(en|acc-in) * P(>acc) » 



The student's intent is not visible in his final 
performance, of covirse, so the probability of actuary 
producing the correct form wiH' include terms aceounlsing 
for other ways it can /be produced, raising the probability 
figure somewhat, ^ ^ 

P( intent [i] I fonii[j]) ; ' . - - 

The analytic problem, which is my main interest 

here, is just the reverse of generation.' The expectation . 
remains and, in addition, the student has actually done 
sdm^ thing. The task is to arrive at an estimation of likely 
intent- The Bayes .Law -f ormuia above permits a calculation 
of exactly what is needed, . , ; 
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Consirler er" used with an expectation of <acc m>* 
The conditional probabilities for er" in the masculine 

are : . . .. , " . " ' , - ' . . 

- i { er Inom-iD^ =: .95 

e:riacc-iiiJ ==: ^^^O ^ 
* P(eridat-iii) = .05 ' 

and the direct proi)abi3,i ties, for each of the cases involved, 

with <acc> intent, are: * ' . \ 



^( ipj6iaacc ) = ^ O4 
pyaccacc) .90 
p(datacc) = .06 



The calculation of int^t is straightforward, First the 

numerator portion of the formula i^: y 

>/ P( intent [i] | f orm[ j ] ) = PCformf j 1 { intent [11 ) * P( intent [ i ] ) 

• T^fnonh-iiiler) = PrerlnoEi-m) * P(nomac^\= .95 * -04 =# .0380 
P(acc-mler) = Plerlaco-niJ * P(accacc) = .10 * .90 = .0900 
P(dat-m|.er) = P(er|dat-m) * P(datacc).= .05 * .06 = .0030 



■sum '= . 1:310 



The denominator of the Bay^eb liawVformi^^ a' noriaallzing 

factor, the sum of all the ni:imerat or terms* Carrying 

out the normalization (divide^each by 0* 13^10) gives: 

- Prnom-mfery = .29 . 
\ Place— ml er 5 = .69 
: P(dat--mjer) = .02 " 

Therefore the l?est guess is that the student 
intended .<acc iil>, with <nom m> consider,ed somewhat less 
pi-obabie . ( The full calculation would show terms for 
gender and number, and include a figure ifor <dat f>- 
See appendix C for more details.) 



Evaluation of method 3 z The numbers are still 
drawn from a hat and are subject (at this stage) to minor 
tweaking to make things come out right. Still thei^ seem 
more closely related to plausible student behavior than . 
'were the numbers in other methods.' Moreover,, their scope 
•is wider than just the intent-guessing problem. Of general 
applicability, the probability estimates may be used in 
'Selection of curriculum as well as in grading; performance 
dat^ from any source can be ddrectly related. inl. adjustments 
to the estimates. The nature of the variables is such . that 
modification through feedback should ; be relatively / 
straigtitfprwaxd. ^imilEtrly , it will be easy 'to introduce 
a bias to reflect a changing curricular environment. For 
instanceV al discussion of dative prepositions may • 
temporarily change -the relative accusative/ dative bias. 
Which just means incrementing the dative probabilities at 
the expense of thbse predicting accusative. 

Additional Considerations ; . Another major benefit 
inherent in th^ probability technique relates to- the - . 

question lofam^M ; 
perf brmance criterion. To review, this refers to a 
situation, in which a particular inflectional form correctly 
expresses more than one value of a particular grammatical 
category. Clearly there is no way to be certain which 
Vjalue -the student had in mind when he used the form; the 
notion "weak correct" indicates that "he is not wrong, but ^ 
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"may not be totally right either. ' An extension of tlie 
J protability method, provides a rather natural way to catch 

these, finer points. The extension Is quite simple; it 
Centers on a consideration of the symbolic, as well as 

nunfer ical , pr obabili t ies * 

A weak correct situation will be easy to identify. 
-There will be, of course ^ the most probable value: the . 

program's guess af the student's iritent. There wg[l± *. 

second , term , another possibility, with two characteristics: 
.1. the endings probability, P( ending I intent), will be 

fairly high, indicating a correct entry; and • 

■.■ , .. ■ , ^ . <■ ', ' . 

2, ;J^he symbolic expression of probabilities will differ ^ 



in only one term from the .expression . f par/-, the selected 
entry. 

The ambiguity responsible for the weakness- is 
precisely in t'le graimmatical categorjj associated with th^ 
differing term. An example may help clarify the operation: 

"-eV is correct for femifiine singular, both 
nominative and accusative case. 

If the expected caise is nom^iriative, . the ending 
probability terms P( el nom~f) arid P(e |acc-f ) are both near 
1.0, ^ they represent coirect 



<ndm s f>: >P(e|npm-f) * P(nomnom) * Prsainegen) * P(samenum) 
<acc s -f > : P(\e j apo-f ) * P(accnom) > P( samegen) * P(sajiienum) 

-r---only the case terms are different. — * • 
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If the expected case is accusative, exactly the same 

ending probabilities appear. Again, only the case terms 

win be different, but this time such as *to bias^he 

qihoice towards accusative: 

<nom s f>: P(e.jnom-f) * PrnomaccJ * PTsainegen) * PCsamenum) 
<acc s f>: P(ejaco-f) * p{aceacc) * P(saine^n) * P(s£iiaeni. 

Since P(accacc) and P(noiiihom) reflect correct ^ 

choices, they -will greatly exceed -their counterparts 

P(noinacc) and PCacCnoicO which do not. As the case terms 

represent the oi^Ly dlMerence in tme second part of the 

expression, the Mmerical. values will' favor nominative if 

iiojninative is expecteds. accusative if jthat^is expected.; - 

The change is due entirely to the changed expectation; the 

performance is precisely weak correct and cannot be used 

in feedback as" evidence tbat the stMerit can correctly make 

the particular distinctioh involved. ^ ' 

'"^ . ■■■ ■ ' " '\ . ■ ■ ■ ' 

^e full strategy for handling giamma tical 

ambigiiity includes not only detection, as just outlined, 

bia* a mechanisins^ put of the difficiO,ty. 

A change in ti^ expecljed^end^^ numbei^^ll b'J'ten 

result in an exercise im whieh^^ t^ desired grammatical 

distinction can be observed, _ performs 

both sjteps, detection and recommendation. - 
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Throughout the foil owing description, I.wilf 
interpose examples in brackets, taken from the follov/ins , 

data: ^ . ? 

. <'-.e«' is correct' f -r feminine si'neulajc'* both 
, nouiinative nd accusative case. 

Jf the expected case is "nominative: ' ' . 

<noiii s f>: P(ejnoiii-^f) * P(noinnom} * Pfsameeenl * Pfsamenuiny 
<acc s f>: P(e|acc-f) * P(accnoiE) * P(sair,e£];en) *- PCsamenumJ • 

1i Represent p-fobahili ties' both symbolically^ and numerically 

2. Use numerical values to select most probable alternative. 
. [ <nom s f> will win] 

^ 3.' Examine -pther' possibilities for both: 

3a. high -probability for; ending itself 
; lWe^<^^ = .95 ]' ■ ■ ■ 

3b. only one term of symbolic expression, for P(i^^ 

different from expression for most probable alternative 
. ' ' • [ ,P(nomnom) vs. P(accnom) ] 

. 4. Consider the grammatical category represented by _ 

the differing terms: this will^ show the location, 
,^ of the ambiguity. . -r 

* T case: could be nominative or accusative J 

V .- . * ■ L - . ■-• . . ^ 

(determination of a better exercise) 

5. take the specification for the most probable , ■ 
• al'ternative, and change the value for a grammatical 
category other than the one involved in the 
ambiguity. 'This forms a new specification, 

[ <nom s f> : try changing gender to "m", 
:\ -.-.-v.- ■' givingr-i:<nom s m> ] , 

. 6. rind the most probable ending: listed under th.e new 
specification. - ' 

[ "-efi^^,. Probability .95 ] ' 

■ ^. Alter the new. specification in exactly the same way 
as involved ih the original ambiguity, - 
■' [ '<n6m s m> : becomes <acc s m>]' 



£ , ■■'^"Examine tlr^ probability for the endirc of step 6 
•v giv^n -cification ari^i\ i at in £tcp 7- 

or\'&Gp-in) = .10 

eaV If i,t is hi^h, then the ziair.c ajrbic^^.ity ren?ains;, 
■' return to step 5, niakinx^ a differert change. 
"• ' .. '[.v/ould happexi in neuter, 

" . as P(eElacc-n) = 9!? ] 



at 



If the ending 'pirobability is not hi.^:^!, then the • 
al^oritl-mi^s successful. A neu exercise conforinin£. 
with the new specification will yield a situation 
in which the current air: bi^uity in the student s 
taerfor3T:arice can be cleared ^up. , 
[ so next time try usins a nasculine noun,. 



aeneral evaluation of the._LhreLjLg.iilQis.' '■ / , , - 

■ '. In conclusion, I feel, that^ the probabili-ty riethod 
offers great promise of . being' the: best of - tiie three methods 
considered. " Accordingly, I have placed its ii^pleken'tation 
high of my list of needejd ikprovenients to the prograrn. 



■ .-'.The foil owing 'topics ' in inflectional analysis 
remin to be '-discussed, in .the order given: 
— . Viord, "type'.' as an ■additional graiuiaatical category 
•— Mioltipie^wprd . inflectional patterns 

Verb ending checks (vjhich provide a good "illustration 

> ■ of the other two)'^T . • , ■ ; 
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Word "Type" as an' Additional Granmatical Catefr6rv 



A basic addition to' the program's representational 
scheme for inflection is motivated by ^the existence of 
varying patterns of endings for some wferd classes.,, The 
article • "der", as^ has teen noted previously, represents 
both the <nom. s m> and <dat s f> slots. However, its 
parent, the article/adjective ending "-er" is limited to 
."dies- words'* for the <nom s m> slot; "eir— words" normally 
take a null ending in that slot. To make the 
representational scheme uniform^, 1 have found- it useful to 
eyj:B^ of £3:^ammatical category to include, a^ a 

category, much like case or gender, the "type" or "class" of 
the word. By so doing, I am able to represent an . ' 

inflectional form inflectional form purely by the values of 
the grammatical categories it indicates 'without haying to 
append special conditions. Thus, in the previous example, 
th^ "-er" article/adjectiye ending couj-d be considered as a 
possible representation of nominative masculine for a 
particular type of word. Its use on an "ein- word" could 
be considered by. the error analyzer as a possible" <nom s m> 
form with an error m word type,, using very sdjptilar logic 
1}o' that needed to detect a mismatch of case or gender. 

One must, of cotirse, be carefijl not to establish 
categories which- do not have a parallel in what the student 
knows,., or the computer will be mislead into analyzing 
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partterns which do. not exist. The creation of new ^ ' . 

categories is done when necessary to allow the 
•representation of slots to be ^ clean and orderly, but also 
' v/ith care to see how the addition relates Ijack to what the 

student is tau£ht. J^ollowine that gTaidelire, I have, set up 

additional categories which I^hall describe area , oy area. . .. 

The specific identifying .numbers accoini.:anying the 
'lieScription are mainly for internal reference of the 

program. * 

Articles: : ^ . 

= • J?'or articles, a Tyj^e "category to distinguish 

between "ein— v/crds*' and^ "dies-words": 

1: ein-v/o37ds, no endingVin.ncminative masculine, i 
nonjinative and accusative neuter. 

' ■ ' - \ ^ 

2: dies-ivords, . er" . in nominative riaseifLine, '^-es*' in * 
nominative aaid accusative neuter . 

/ 3: -^eithe/^at student option *^ 

Complications (not. yet implemented, except as rioted) : 

• ■ ■ ■ ■ * . ■■ ' ^ ^ . - ' ■ , ■» ■ 

* "—es" becomes effect ivfgly "—as" for the definite 

' article <noiii s n> ajid^ <acc s n> ("das" instead 
of "des"); , . / ^ 

, • ; ■ ... 1,.,:,,. ^ ■ 

"ein--"' takes type-2 endings in the special form- 
"was fuer ein-" ; • . 

'■ ^ ■ , ^ - ■ . " . " ■ ' ■ ' - ■. 

• "ein-" has no plural form ( implemented) ; . . 

- adjectives lose many of the saine endings, but also 
■ have ajLother set, the "weal^" endings, with ^rather 
complibated rules for^ deciding whicl^ set to use^ 



The actual numbers are really octal, corresponding to 
single bits, to allow logical— "or" conditions on various 
gj^l^ tests land s^ 



Nouns : ' . ■ 

For. nouns,, a Type catecory to distincuisfi the 

varyinc inflectional patterns, chiefly in formation of the 

■ ' • .0 .. / 

DluraJL. , . ' ' '1 

Some. texts classify nouns according to the 
pkrticuiar ending used for the plural as a means to 
reir.emberdn^ which endins to 'use. That, particular 
classification is cf no use here"; in reneral the choice of 



eating is ,essen,tially arb.itrary and ^best presented to be 
learned by rote. .The proa^am jnay -well note tliat the" ^ 
student has , selected' a wronf, alternative pluraJ. enqihc as > 
distin£uished from a hohe^^i^^^ one, , but that dia£^osis 

does not require a graiKmatical category. 

There is an interesting interaction, however, 
between the particular ending and the specific kinds of 
inflectional changes which will occur. Por example, most 
nouns. roust add an "-^-n" ending in the dative plural; but a 
:. noun whose plural is formed with an "-n" will not show 
further inflection for the dative plural. •: Kufner 
distinguishes some rf5un class^l' oh these criteria <ref. 7.1>. 
^ Even though the distinction does not. appear explicitly in ' 
common texts, it fits- right in "with the . way I have. employed . 
grammatical categories. In order to know which inflectional 
slots to associate with a particular form, the program must 
. 'kijow which, inflectional pattern is involved (and worry about 
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whether, the student knows it too). Accordingly, I have a 

type category fpr nouns, -with a six- way divisiorj: 

(1&2: Nouns which do not inflect at all tc indicate plur^, 
neither by uirilautirtg the stem vov;el nor adding an ending. 
This class, incidentally, does not include any feminine 
nouns; since the article/adjective endings are the same- 
for feainine and plural, there" woiild be no way to _ 
distinguish the two, if the noun itself did not inflect,; 

•' 1;. No, inflection for noicinat^ve. plural, , but. add, an "-(e) n 
for dative plural. \ ' 

"2: No "inflection for nominative plural, and stem ends in 
"-n" already so no dative plural change either. 

4: Nouns whose only change -for plural is. to add an •• 

umlaut, and whose steirs end in "-n" so there- is. no 
' dative plural indication. 

10:" Nouns using "-(e)s" to form the plural; these nouns 

also do not distinguish dative plural, and use the 
same ending for genitive singular- unless feminine. 

20: Nouns using "-(e)n" to form the plural, . do not i^.^^^^^^ 
distiriguish dative plural. ^ ' i/r ' 

40: [Masculine] nouns forming their plural with "-(e)n'V,;., 
which use that forir also for accusative, dative 
- . . and genitive singular* " - . . 

•Complications;. (not yet programnied): . • ; . 

' Nouns with very irregular plurals; 

Compounds with "-mann" forming plural with either 
/•maenner" or "leute" ; • ^ - 

^ Some nouns formed from adjectives, which use 

ad-jective endings. . 
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Verb£: 

For verbs, a Type cate£ory related to the kind of 
irregularity. ^ . 

Traditional grammars list groups of verbs with 
similar vov/e.l changes, the- seven "ablaut classes" . These 
could be easily included in my program if arj instructor 
desired. However, the emfihasis in teaching seems best 
placed . On tlie individual verb's. form rather than its ablaut 
class, particulg^y since the classes are riddled with 

— exceptions.: so it does. not setem productive to use these 
classes in th6 grsider pro£^ajii» ' . ^ 

Definite inflectional patterns do exist, depending 

. on th^' particular degree of irregularity of the verb. i^Por 
instance, some irregular- verbs indicate third person singular 

• present with a changed stem vowel, others do not. This 

type of pattern, even though not taught exi^licitly, fits . 

'" -precisely the extended notion of sr.anuiiati eel category. 

' ■ ■ -I* 

(1,2,4: no vowel change in present tense.) 

1 : regular verbs " 

2: 'weaJt verbs with irregularities 

4: strong- verbs- 

(10,20,40: vowel change in, 2nd and 3rd person ^inguiar~^ 
present) , 

' 10: 4(/eak irreguliar verbs (especially "haben") . 

20: normal strong verbs 

40: modal auxiXliaries, which use the irregular 

form in the 1st person singular, as v/ell, and have 
. a null ending for both Tst and 3rd person singular 
present (also **^wissenj^) 



Corplications (not yet pro^amicd) i 

"sein" ("to be«') fits no i^attem at all in the 
present ; 

some stems ending in "-st" and "-s" have 

peculiarites in 2nd and 3rd. per eon x^i-esent, 
adding only part of the- "-st" ending, or 
none at all; ■ 

and EQary more. . .. . ^ . . 

Note: . A 

" At this time, only present tense hELS been 
implemented, so the category has only 'teen partly laid out. 
In particular, no distinction yet exists between classes 2 
and 4, as their irre^ulai^ities appear only in the past 
participle and past tense. Other provisior^s will be needed 
to handle certain characteristics of the p£.st particii^le. 

Eva luation of the Type Caterer y : - .. 

,. ' ■ ■ . ■ ' - ■ U 

The patterns described above have eji existence 
independent of the means used to describe ^them. The 
particular method employed for checking inflections v;ill 
determine whether the type category is appropriate. For 
method 1,- the, tabular approach, noun inflectional i:>atterns 
are a very real: perturbation. The table will pfobtibly neea 
to separate entries for the noun -type as given. Method 2, 
the distance metric, assigned a value to type^errors. It 
proved difficult, however, to work the desired decree of 
precision into the weighting scheme; eiophasis on proper 
• -eSrpe seems to vary from slot to slot, ^ perhaps dependin^c on 
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whether the class requires a positive inflectional 
indication or simply the lack of one. 

Por method 3, probabilities, "I have not, yet done 
". the necessary- analysis'. 1 believe that ispecific 
^ probabilities can be introdu&d to get the kind of 

we i^ ting 1 needed but c'diald not get with E:ethod 2. For ■ 
verb inflection checking,', the typing syster. have been r.ost 
useful. It allows a facile dis^oels of situations suciri as 
the Qor juration of a verb on a pattern which would be 
correct if the verb were [were not] irregLilar. 

■ • - ■ . . . V 

Multiple-Word Inflectional Patterns 

Quite frequently, the inflectional, check consists 
not just of one actual forin versus the expected 
characteristic, but of several forras all -showing some 
inflection. Por example, even a simple noun phrase will 
have inflection in 'both the noun and the article. ' The 
student's intended answer plays a strong role in such : 
circumstances, as it finds expression in not just one but 
several imflect ions- 

Sometimes one word governs the other, as 
sub ject-velrb ; other times, both lake their specification^^ 
from somewhere else, as case and number in- a noun phrase, 
in either situation, the. common soLirce (step 2 in the rule 
' •* ■. ■ - ' - , . . ' ■: ■ 

*■ , • *( * - ■ ■ ' - « ■ , 

; • ■ x , ■ . . . ' 

■\ " O ' ' ' ' . . .. 
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framework) should mean a coimnon value for the indicated 
category, even if it is actually a vrong vclue. The - 
student deserves credit for correct j)erf orr.ance under the 
"internal agreement" aspe^ct of step 2. 

A three-way comparison may well be indicated. The 
' incividual words must "be checked against each other, and 
the result of that comparision checked against the ; 
expectation. If there is any disa^eeinent the expectation 
must be checked against -sach word individually (Perhaps 
also necessary to investigate, possible weal-^-correct 
situations). One approach to the icultiple-word problem 
lies in the pairwise aI)plication of the sare comparison 
technique a:s for a single form, thereby reducing the ^problem 
to a mana^^eable form."- Per tv/o words, say an article ajsd* a 
noun, several distinct .possibilities exist: 

IT all three agree fine, correct; 

2, the inflectional characteristics associated with the 

actual forms agree with each other, but not with 
the values of the expected slot- — a fairly clear 
instance of "internal a^eeraent" on an incorrect 
value ; - " 

3. the charactQjyfstics associated with the actual forms 

do not- agree with each othei-, hov^/ever one of them 
■: ■' agrees with the- expected value — The latter form 
is correct, the other, wrong, and there is no 
internal agreement. ^ ' • -.. 

- Unfortunately, because of the multiple 
characteristics associated with a given form', the following 
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confusinr -ituation can arise (using a single letter to 

represent an a? tetrary slot): 

Expeicrted :slot : a . • 

.Wore: 1 - sctuali characteristics: X, I (two slots) .. 

Wore 2 - actueii characteristics: Y 

In this instsirjce* ther is internal a^preement for 
characteristics *" I", wE^h is not in agreement with the 
expected value, fit the same time, the expected value 
matches^up we2-l .ssainst th^ "X" characteristic of word 1, a 
characteristic ncc shax-ed by word 2; a situation which 
could be interpreted as a form error in word 2. To take a 
concrete example, - 

. Expect: "sie trinken" ("they drink'*) , 

Slot: <3 p *> (3rd person plural. 

, any gender) 

Actual: "sie trinkt" ("she driniib-'?) 

- ■ ^ ' , ■ i 

Pprms used, and associated characteristics: 

"sie" — <3 s f>, <3 P *> 
"-t" — <3 s *> 

Diagnostic analysis: 

Expected - subject agreement on <3 P *> 
' ■ Subject - verb ( interhal) ' agreement qn <3 s f > 
Overall: no overall agreement 

The dia^ostic picture becomes even more cloudy v/hen 
the comparisons between slots are not exact, but must 
rather be referred to something like,, the closeness metric 
to determine which is less wrong. The. symbolic example 



nii£^t then have ,X sjid X', I and Y', ano a .iud£3r;er.t v;ould 
be" needed about how. X lEatched l', cojTipr d v/ith hov; ^ 
matched' Y', compared v;ith how X matched X , cor;:pared with 
how X' matched Y: instant inssjnity. . . „ 

SoF.e relief is~ afforded- by the specifics of German, 
in that an. PiJ-tiple-inflection situation usually is quite 
de£;enera;te in temii of what cate£;ories are definitely 
indicated by which v/ords. - Most often, the inflectional ' 
forms coinpleir.ent one another, adding precision to v;hat 
would otherwise be ajnbi£:uous. Thus the "die" 
fen inine/pluraJ. confusion is reg-olved by the noun ending, 
the noion shov/s nothing of gender in its ending;, and the 
aiiibiguity of nominative/accusative^ for "die" it- not resolved 
in either word's inflection. The number of outright ' 
conflicts is thus" less than ni,i£ht be irasined at an initial, 
theoretical investifjation. 

; Whjsn Method 2 (distahce metric) v/as being used for 
article ; inflection, I put it to work on th& .coirbined 
inflection problem, too. It was abandoned' because' it ;iust 
didn't seem to reflect the rigiit balance between what one 
word showed alone and what effect it had on" the other word 
(and also because of the difficulties of size and complexity 
mentioned above). For Method 1 (tabular) to be^ used in 
his context, some more information would go into the t^^ble, 
or the tabular results for. the two words might be combined - 
by sSme other algorithm, as yet unborn. I shall be happy 
'if Method 3 (probabilities) allows the necessary interaction 
to . be expressed in a.« cleaai - formalism. 
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Sii:ib.iect--Verb Inflection Checkinr: 



Implementation of the verb- sub ^ject check has been - 

reasonably satisfyineV . It provides a r.odest illustration / 

of a number of the important ideas of the j^^adin^ program, 

thq.u£;h circumstances have combined to make the demonstration 

far from complete in many instances. I have, so far, only^ . 

implemented analysis of the present tense. In that tense, 

verbs have only five fcrms, showing only tv;o basic ^ 

erarauatical categories: person and nunber. ' If the subject 
■ * 

- of the sentence is a - noun or^ noun phrase, the available 
. alternatives are further narrowed, as only 3rd person is 
then relevant^ Pronoun subjects range over all values of 
person and number; but long before the verb analysis is 
begun, the pronoun search logic takes care of deciding which 
slots are actually^ pertinent. . 

. - The rule governing verb inflection demands 
agreement v/ith the subject and the program .has always 
concentrated on just that demand. If the subject is not 
what was expected, I still base the ending check on what is 
actually present. In terms of the general inflectional 
theory, the checking is defe9tive, as it lacks a comparison 
with the expected value. Because so very few forms are 
involved (and one of them is identical to the infinitive). 



. there didn't seem any point in going to a more involved 
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calculation. In several ways, however, a check with the 
expectation does enter- into the deliberations: 
^ 1, the determination of subject characteristics is done 
in consultation with 'what should occur; 

2. those inflectional ^characteristics which depend on 
the verb type (regular verbs, irregular, etc.) are checked 
through a comparison with the correct type for that verb; 

3. as the scope of the implementation is expanded, 
future versions will have to consider the expected tense 
and mood, particularly when dealing with irt-egiilar verbs. 

I am intrigued by the two-way^checking which 
sometimes occurs between subject and verb ending. The 
primary check is direct from subject to verb: find the 
subject, figure out what its grammatical characteristics 
are, and then check if the verb ending agrees * iiov/ever, 
"there may be some ambiguity in the grammatical attributes 
of the subject; the verb ending may be able to shed some 
light on the confusion. In particiaar, the verb inflection 
shows a very' clear distinction between singular and -plural; 
nouns and pronouns may be ambiguoijis in their- indication of 
niomber. The pronotan search already contains logic to. look 
at the verb ending when a subject pronoun is involved. The 
noun phrase logic*^ does not yet make a similar check, but 
should. The two-way check is not without its. problems, not 
the least of which are the 2nd person familar forms. But 



even in limited implementation, the verh two-way check 
provides a nice example of how an examination of both parts 
of a related inflectional scheme can aid the determination 
of just what the student had in mind. 

To handle irregular verbs, it was necessary to 
expand the verb inflection logic to include a consideration 
of the stem as well as the ending, and to add another 
grammatical category to reflect the type of visrb. (See 
above for a complete* discussion of the "type^' category.) A 
modest multiple-term inflectional situation thereby exists: 
subject, stem and ending* My basic strategy is to attempt 
to form a composite frcam the stem and ending which can 
represent the whole verb, and then compare tliat composite 
with the subject. To properly diagnose errors, individual 
comparisons of subject with stem or ending are often 
required- 

The ex&ct nature of the comparison varies slightly 
from one stage to the next. Throughout, the very simplest 
comparison scheme is employed: values match or they do not, 
with no half-correct measures considered. Of course, a 
particular ihflectional form for stem or ending will have 
severe character slots associated with it. In forming the 
subject-end composite, the program evaluates all possible 
combinations between the two sets of slots. Only those in 
whicb jhe values for both person and, number match exactly 



are retained. Each slot includes the specification of one 
or more verb types: the intersection- of the verb types is 
formed for the composite terms. In comparisons involving 
the subject, no verb type is relevant and the program only 
need check the exact match of person and number. 

By checking: subject against 'the stem/ending 
composite for an exact, match, the program can tell directly 

whether the ini lection is proper. If there is an error in 

- ■ ■ ». 

the stem, the subject/stem .comparison v;ill bring that out; 
likewise for ending error. Another possible diagnosis is 
that of verb type error. The program must check the verb 
type- of the final comparison "tprm against the correct type 
for the verb. If -there is r>o match, then the inflectional 
f orm v/ould be correct except that the student has not 
properly observed changes due to verb type. Other type 
errors may be localized in the. stem or ending, and- can be 
easily -spotted by a similar comparison of types.. 

A full dejscription of the verb inflection check, 
along with several examples of actual prograor output, can 
be found in Appendix D. 
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Chapter Eight 
WORD ORDER ANALYSIS 

Introduction and' Definition : 

A Aajor grajmnatical phenomenon, that l-crown <as 

"word order", is based on the order (sequence) of words in 

■ ' . \ 

the sentence. In my work, I use an extended notion which 
also covers the order of phrases and elements, and includ'^t; 
the determination of the basic clause tj^DC. like 
inflection, v/ord order -as a'-grannnatical phenomenon appears 
in differing roles: in the identification of the individual 

■ . ^ ^ 

words of the' student's sentence; in deciding wliat the 
assembled element or sentence Kecins; and ai number of . 
grammatical rules which, in the instructional setting; must 
be checked for correctness by the grading pro£:ram. 

. The general analysis used in" this j)roject considers 
the German langi;iage. a6 having three grammaMcal levels: 
^ words, clause elements, and clauses. Thi^ tripartite 
division is of particular^ pertinence to word order, whose 
role centers .around the two- transitions: word to f ■ ' 

clause eleiiient, and clause eleEent to clause. ^ . ' 




From Words to Clause Element 

A clauso element is tuilt out of words, basically^, 
with perhaps a little internal struc taring. The meaning 
and grammatical function of the element derives mainly from, 
the choice of word:., along with inflectional indicators. 
V/ord order within the element' is not very interesting. The 

German language allows little or^ no flexibility of order; 

( < 
the rules are thrre to be followed, serving perhaps a \ 

useful function in allcwing one to group together the 

consituent words of the element without being confused by 

neighboring ones* Accordingly, .1 use word order within the,. 

element as a major guide in the identification process, 

looking for modifier and functional words where they should 

be located relative to the head word of the phrase (see 

Chapter 5).. Particularly Tor the simple structures which 

the program can accept at this time, I consider 

within-the-clause V\ford order errors unlikely, and provide 

no diagnostic facilities for them. - » 

For ref erenc6^ the current 'multi-word element / 

capability is as follows: 

(noun phrase) : article + noun ' 

subject or object : noun phrase [ article + noun ] 

:* preposition + noun or pronoufi 

: preposition + noun plirase 
. . . . / ,[ article + noun ]^ 



In the near future, a certain amount of flexitility 
will be reouired due to a planned expansion in the 
coEplexity of noun phrases. For instance, I will include 
phrases with two nouns, as in "eine Tasse Kaffee" ("a cup 
of coffee")- The search i:outines already have the needed 
flexibility;' what is lacking is the error checking and 
interpretation. . 

' rroDi Clause Element to Clause 

A clause is constructed out of clause elements. By 
careful desig^i, the word order, analysis of a clause is 
deferred until all the constituent words have been grouped 
together ijfto elements. As remarked above (Chapter 3), word 
order (at the element level) is not used in searching ' for 
elements, neither in identifying them nor in determining 
grannnatical function. This follows from my general 
philosophy concernirig errors, that nothing which the 
"is^tudent might do wrong should play- an important role in - 
identifying his response; also from" the inherent "flexiblity 
of the German, language regarding the 'ordering of elements, 
which tends to preclude any prediction as to wliat element 
will come next (particularly arcong sentence field elements, 
where help is most, needed) . , 

. Two Tasks ; The task of the word order portion of 
the pro^jram is twofold. The program must check the order. 
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of the clause elements for conformity with rules of grammar 
and with the. shade of meaning ^'ncorporated in the expected 
sentence. Furthermore, the^program must determine, by 
analysis of the actual student responsis, what type of ' clause 
has been created (main or subordinate; statement or 
question) , and give an interesting diagnostic report on 
-"its findings. The clause type determination is done first, 
as some parameters of elemeni ordering depend on the tyipe 
of clause. . 

Cls:use Type Determination: 

"l^he clause type determination subscribes, in 
theory, to the three-step model: cue, intent, actual 
response. However lihere are only a few forms, far less 
than for inflection. Also,' the nature of the verb 
placement rules is such as to leave little ambiguity as to 
what was intended. Most likely, the student will err 
either in a' wrong intent or in partial distortion of the 
execution step; in either case, his> intent will show quite 

clearly. . 

The first step, then, is to determine what he seems 
^to have done. Then, if ;bhatis. not correct, considerations 
of motivation will be in order. Errors in execution may 
confuse thfs initial determination. It may even become 
clear that the student does not know the applicable German 
word-order riiles at all, in which. case a diagnosis bf . 
"confused" will be returned. - V 
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For reference, here are ,the clause types currently 

included^'in the program's German sranmar competence. For 

each type, i list the identifying characteristics: . 

■ - . . ■ • ■ 

TYPE 'characteristics 

Statement rV-2: Finite verb in second position, 

i.e. at front of clause preceededrby _ - 
exactly one clause-eleir.ent . 

Information Question ("Who is that?") 

FV-2: same verb position as statemeni:. 
Initial element will be the questipn word. 

Yes-no question ( "Are you coming?" ) " ^ 

^' ■ V . FV-1 : Finite vgrb is at the very front 
of the clause. 

Dependent ■ clause . ^ a 

rV-L: Finite verb is Last, at the end 
of the clause, following; even a dependent 
' infinitive or past participle. Th^ initial 

element will. beca relative or subordmatmc 
word. - ■ ' 



(The "finite verb" is the inflected verb, as distinguished 
frc4 an infinitive or past participle. The "FV-n" notation 

• ' refersto the finite verb position, and is borrowed from 

Kufner <ref. 7.2>.) ' 

As /hown in the chart, verb position and 
introductory words are the main factors distinsuishing 

• different clause types. There are several other 
differences: the dependent clatise FV-L configuration is the 
only one in which an infinitive can precede the finite 
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* Or occasionally it may come second; see <ref. 9-3 ; 
Reference Grammar, B7'86>. 



verb; a separable prefix will not be detached from a finite 
verb in FV-L. The program also relies on its knowledge of 
the expected type, particularly: - 

1. when an error regiiLtr. in conflicting indications (i.e. 
an initial relative wprd, but no FV-L )i 

2. when a clause has so few elements that a clear 
determination is hard to make; 

3. to decide between command and yesr-no question (not yet 

implemented). 

Type Determination Al/rorithm : . The' clause type 
determination algorithm begins/,by bjeeaking the nonverbal 
elements of the clause into three groups: those -before the 
verb, those between the two verbs, and those foil ov/ing the 
second verb. (If there is. only one verb, the last category 
is null.) It can then make the following pattern telsts: 

( *'FV" denotes Finite Verb,. "DI" Dependent Infinitive) 
' ^ no elements "before FV means FV-1 format; 

— no elements 'after FV probably means FV-L format, 

particularly in DI precedes it and/or. ;there 
is "a Rel. Word present. ' However, the simple 
form <£ubject + FV> Is FV-2, a statement; 

— one element iSefore FV is a statement, FV-2. 

' ' ' ^ . ■ • ■ . ' . ■ 

To handle erroneous format ipns-, the actual patterns 
are somewhat broader than the above, to wit: 

— DI prec ceding FV probably indicates a . dependent clause 

(FV-L); confidence diminished if no Rel. Word, or 
. if FV position is too far from end of clause; 
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— FV last position can indicate a dependent clause even 

if the explicit Relative Word is missing; 

— FV last position can indicate a dependent clause even 

if the initial v;ord. is a Question Word (unless 
• there is nothing else . "before the verb); - 

■ — FV in the middle of the phrase is tefitatively 

' identified as FV-2, particularly if a DI is present 
and follows the FV. 



Vai'ious grammatical -checks must be made: 

— for FV-I &: FV-2, if a DI is present it must come at the 

very end of the clause; , DI follov/ihg immediately 
• . af ter FV is taken as a specific mistake, perhaps an 
indication of ah English pattern; 

— for "FV-rL, DI ('if present) must be at the end of the 

clause but prec ceding FV; 

— for FV-2, FV must really be the second lement; 

— ■ separable prefixes associated with DI must always be 
attached ; 

— separable prefixes associat6d"'with FV rjust be detached 
except- for FV'-L. 



The tyjDC determination is accoEipanied by a level of 
confidence, evolved dynamically during the course "of the 
analysis. The initial guess /.ill usually have "high" or 
"veryhigh" confidence. Errors will cause tiie program to 
successively lose confidence in its diagnosis; if a level 
"none" is reached, the determination will cease with a 
value ••confused",. Likewise v certain observations may boost 
the confidence level; correct performance in placing a- 
separable prefix is a notable example. . 
. . •.. A full description of the ^goritlim is given in 
^ 'iippfendix E, followed by examples of its operation on more 
EBj£ bhan a dozen sentences in al)pendix F. , . 



Diagnostic Interpretation : Interpretation of the 
clause- type findings requires additional analysis. The 
three-step model will be most useful: cue (expectation), 
intent, and execution (actual form). 

• The student must make two decisions in deciding on 
a clause type: first, wheth'er it will be a main or 
subordinate clause; seconjg, if a main clause, whether a 
statement or question form is required. I shall first 
consider the determination of type for a main clause. 

The decision^a£ to statement or question is based 
primarily on semantic criteria, what the sentence is to 
fsay. An error,-, then, is not .so much a violation of a 
grammatical rule as it is the production of a different 
sentence. I do not corisider such inter chaiige error very 
likely, other tha^ due to the student misunderstanding 
instructions. I find it liard to say whether an error would 
be due to improper iniient or faulty execution, as the 
required execution is fairly straightforward and natural 
for an English-speaking students 



Specifically: ■_ — ; . 

1 . statement becont PV-t les-no question : probably ^ ^ 
intentional (no Qi^ beginning a statement with 

■ a verb) ; ; ■ ■„ ■ 

2. statement becomes FV-2 Information question, or 
"^vicer-versa: probably intentional, as the use and meaning 

of the introductory question word is quite clearf; 

3. FV-1 Yes-no question = becomes PV-2 statement: 
Possibly intent wrong, or possibly execution confusion on 
an English pattern ( "Today is ^yone lonely?" ) . 



The choice between main and subordinate clause is 
deiermined on a purely granmiatical basii^^pom the structure 
of tiia sentence: one main clause, all others subordinate 

the usual exceptions that prove the rule, particularly 
(mediation). I ant icipete many errors just at this point, 
if only because En£;lish ailowis one to use subordinate 
clauses without bein^ aware of so doin£ and without the • 
subordinating word. The clause type determination takes , 
into account that the student may use some of the trappings 
of a dependent clause, particularly, the subordinating word, 
without realizing (anci thus without intendin^s) the 
subordination. Or, conversely, he may consciously produce 
a- dependent clause- without a subordinating v/ord. 

There is no waj^ to be cei:tain whether ' the error is 
in intent cor execution, though some guesses can be pyt 
forward oh the basis of available data. For demonstation 
purposes, I have set up a decision network to handle the 
guessing. It is guided primarily by 'the ti/o main marke^rs 
of dependent clauses, the subordinating word and the finite 
yerb-"3ast posltlonw The conclusions are as follows: • 



issrfbnnance 

Sub^rH^Gird 
pres^ree 

ok ok 

ok no 

missing ok . 

missing no 



diagnostic interpretation 
intent • Executitm . 



correct 



correct 



maybe — or — maybe 

(either One Tight, .other wrong) 

prob, ok errors missing word 

wrong must be referred to 

what Was done instead 
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' -The above chart, which has been implemen.teA, could 

• be augmented by information from the placement of .dependent 
infinitives an s separable prefixes.! At this, time, they are. 
used the basic type deteriaination but not in this secondary 
analysis. 

pheckin^. of Clause-Level Word Order ; 

Having decided on which type of claus6 is present, 
the analyzer "is ready to deal v;ith' the eleEent word order 
of that clause. All pre;^i^^ary w is out of the way: 
the claiise limits have bee|a<iefined; all the constituent 
elements have been, formed from the ind.ividual words and 

"" analyzed; any informati 
perfornance, such as the determination' of-clause type,^ha^ 

■ been done; and, of course, ther^ is the information on what 

is e^jpected.' 

• . . Clause" word order is governed by various kinds o£ 
- rules: some, like the verb placem^ 

above, are ; very strict, closely presc^^ 
primarily grammatical import- Other positional decisiohs 
can be made more flexibly, with word order determined, by 
the desired meaning or influenced, "by the particular; • 

emphasis desired. 

Meaning determinations are connected with the need 

to establish' ^am^^ 

example, the distinction between subject and object is 
sometimes made on purely positional grounds. Or meaning may 



figure into -word. order because of the varylne modification 
involved in different positions. The negative adverb 
•'nicht", for example, has a different interpretation, 
.depending on where in the clause it 'Dccurs. Emphatic 
shifts occur as certain elements are placed in different 
order. This applies especially to the choice of the initial 
element, but also to other ordering considerations. (This 
kind of emphasis is familiar in Englisli mainly as stressed 
words when speaking: "Kans will buy 1±ie book", versus "Hans 
Will buy -the book"; in German, word order is a coimnpn 
factory as intj^the- less coramon Engli^ example "Over the 
house the little airplane flew.") 

The various word order rule checks are best 
described one category at a time. , 

Verb PlacemenTt : Verb, posi-bion is very closely 
prescribed. The dependent itif in it ive "(or im-ticiple) will 
ustiaily be at the end of the clause^, followed only by the 
finite verb if a. relative clause, and an end field if there 
is one.* T&e finite verb must be eiiither first, second or 
Idst depending on the- clause type. ((The program vyrill 
accept a statement, " i.e.", itfiim a dependent infinitive 

in .thei^-frcsnt fieldl.) Verb position is not only a rigidly 
specified item, it is central to the whole element word : 
order of the clause. The program accepts certain 

* I am dealing with a simplified grammar, - of course. 



distortibns of the correct pattern as reasonable mistakes. 
If the student's performance is worse than these limits, I 
merely assume that he does not know German word order at 
al3., . The following account of the program's specific case 
handling does not include a category of "elsewhere"\^^^A^^^ 
verb position other than those listed will usually cause 
either a failure of the clause type determination, or a 
^verdict of "no confidence" in that determination; in either 
event, further word order checking will be discontinued. , 
(The following Algorithm is part of the PHRASEDIVIDER / 
PHRASETYI5KES>©BT complex; see appendix E for full 
description.) 




Finrfce very position for a statement: 

position 1 not recognized as statement 
position 2 - — - correct 

iposition 3 — incorrect - "front field error" 

but tolerated, in* the presence of , 
this error, no checking is done on 
the front field (see below). 

Pinxte; verb position for a dependent clause: 

- at very end — ^ correct 

followed one element — — correct if that element 
,j, . can be an end field; otherwise, 
clause "may. not be reco^^iized as,- 
' > dependent. if so recognized, ' 

tdler?ited as incorrect, and offending 
worci .identified gi^ an illegal end field 



Recognized only if a dependent infinitive preceeds or if 
Q there' is relative word and some other element before the verb. 
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. / ^"^^ - 

* i - -J . , ■ ' ■ - 

-i • 

Dependent- infinitive position in main clause: . ■ 

at very end — ^ c<?rrect 

followed by one element correct if that element 

' can "be an end field; otherwise, 

incorrect but tolerated. The offending 
element is identified as an illegal 
end field. 

neither, but imroedlately following finite verb 

incorrect - "possible Iii£;lish pattern". - 
ikmt tolerated. (Conmaxe En^isfe "I can" 
ibi^f a new car today-.'" ) 

,Dependent infinitive position. in a dependent clause:- 

^Vollowing finite verb - — - incorrect — "finite verb must 
\^ EbiLlow infinitive im dependent clause" 

immediately preceeding finite verb correct 

elsewhere — - iaacorrect - "rasLdcK^ positioning" 

° ^ x^l; tolerated bust if fiinite verb 

. ■ ■ , ., "innsition is okay pa-ocisssine; "w£ll 

../. -csimtintie, igntscinE ifche positiGn of 

; " r . ttiSe infinitive. - 



Sentence I'ield : The "sentence field" elements 
include the subject, object's, mbdifiers^ and many ■ predicate 
complements; in short, jaast about everything- except the 
-verbs themselves. . In a jstatement, exactly cme of the 
sentence field elements is placed in front of the finite 
verb, in the "front field". .. , ' 

A l^rief presentation of the relevant definitions 

aiid rules of „ German gi-ammar is available ii^ appendix A. 
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Clause word order is inainly a question of the 
ordering of the sentence field elements and the selectl.c!Ti 
of one element for. the front field. As the front field 
element, if it even exists, is simply removed from the 
sentence field, the program is designed to handle word 
order '■'for the :f ull sentence field. If an- element -is 
removed to the : front field, its absence is easily notsi; jLf 
the student seilLects' ^ different front field element tha.13 
the expected one, the word order analysis c^Sji still 
directly. - ' 

" y Word order within the sentence field tends itos Ik 
spec if ied in rather va^ue terms., - e:ven by the Refer esice 
Grajsunar; various rules are offered to the students, six? 
certain relxance is placed on what sourids ri£;ht • Tte 
program caia he given the ex^ctfed word order for the 
sentence and compare that against what it receives fr^^ 
the student.' Wot only woiild I like- the, program to be 
more self-suf f i cien t ; a. grtiater internal knowledge ia 
essential to he able to produce reasonable error mes^^^^^^^^' 

- The most important ordering rule is tha,t call«L 
*'news value* ". When this, is invoked: over other CQnyentjpffim^^ 
the program is totally at the merest of the expec4(ation 
inf ormation^ The other conventiipris available to guide the 
Jprogrami ar^ assembled in a "normal word order" w Where , 
ppssible, I have had the program check adlierence to the 
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normal word order rioles: *• . ? 

1. pronoun elements (subject or ob3ect)"must preceed noun 
elements in the inner field. They cannot be shifted to the 
end of that field for added stress; 

2. pronoun subject must preceed a pronoun object; 

3. noun-phrase object with an indefinite article follows 
one with a definite article. . (Not yet implemented; given 
in oext only through examples,, not as explicit rule). 

- , ■ , ) . ■ .■ , 

I am not certain how best to approach the rest of 
■** ' • " ■ ' . , ■ . 

the checking. The following considerations apply: • 

■ 1, A way is needed to compare two orderings, such 
as the expecte^d with the student's. Simply counting \ the 
number of permutations is 'prjobably not " optimal from'' ai^^ . 
educational viewpoint; although some errors are simpl^^.due 
to confusion, others are doubtless mor^ systematic and \^ 
should be so explained. I have done some preliminary i 
investi^tion into an appropriate algorithm, but not enough 
to report here. ■ Y 
2* Sincie there does exist a normal word order, ! 
the student's performance will have to be compared with j , 
that as well as with the expectation in order )to catch 
errors of omission as well as comission. ssThat is, the 
expected may differ ffom^ normal in- one. way, but the 
student produces a sentence without tiiat deviation , rather 
some other. Wi thout ref erence ~to the norm:, the program 
would be forced to diagnose an interchange of elements, 
rather than a "you did this instead tof that" report. 'Por 
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-example, let normal be "A B C D E",. expected be "A B D E 
C" and the student's clause stand "A C D E B", assumine (as 
is the case) that there is an applicable rule governing the 
moving of an element to the end. The student, clearly, 
moved "E" irtstead of "C", but a pure comparison of his with 
the expectation will say that ''B" and "C" are exchanged. 

/ Sentence field word order may have direct 
grammatical role under some circumstances. Ordinarily, 
inflectional changes are sufficient to distinguish between 
subject and object; or the semantics are'=" sufficiently 
xmsmhlgaous ^ buy people). If neither 

indication suffices, then the sentence must adhere to 
normal word: order in order to be unaiibiguous; that is, the^ 
subject preceding the object. (This check Is not yet 
implemented.) - 

Tront field Choice : If the clause is to be a 
statement, it must have one element preceding the finite 
verb, in the "front field'-'' slot. This element is taken 
from the sent ence.fi eld, as noted above. If the front . 
field contains more than one islement, the program will 
disclare a verb position error , as noted in the chart above . 
Since the student - is obviousiy confused .about the def intion 
of the front field in that case, no analysis is, done on its 
constituents. , -If .the front, field is correctly formed, the 
program miast decide by reference to the expected sentence 
whether the correct eleinent has-been chosen,^ The choice is 



- •149 - 

purely semantic, so there is little the program can say 
other than to~ report correct or incorrect, (The size check 
is currently implemented, the choice of element not* ) - 

Front Field or End ..Field Elements : If- a clause 



element consists of a dependent clause, it %ust he placed- 
in either the. front or end' field. (The "end field" follows 
everything, even dependent infini The program can 

easily check this rule, as well *as referring to the 
expectation information for what mi£::ht have been done. 
(Placement check is implemented; reference to what was 
expected, not yet*) 

" ■ .... • - , \ ' ' ' . ' - ■ ' 

Separable Prefix : The separable prefix which 
characterizes many German verbs- falls under rather definite 
positioning rules: itMs separated from a finite verb in a 

. . ' - ■ . 

FV-~1 or PV-2 clause, otherwise it must be attached. The. 
program currently goes to a lot of trouble on account pf 
these^pref ix^s, partly because it checks the disposition of 
^ the^ pref ix as part of the clause type determination, even 
though it is not Liitil that determination has ^been made^% 
that the prefix check jean/ be done properly." If the prefix' 
location is inconsistent with' the clause type as , decided, 
the confidence level in the tj^e determination is reduced*. ^ 

In reporting the prefix^ position, the program 
distinguishes infinitives, for which the. prefix must - 
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never separate, from finite verbs. Pour categories of 
placement are detected: attached; detached but imined lately 
preceding the verb, which is treated as attached with an 
orthographic problem; detached and at the end of the 
clause; and detached in some random location. The attached 
and detached- (proper location) cases are then compared with 
what is proper and messages are generated accordingly. 

The detached- (random position) case should receive 
more attention, as it may acttially indicate a different 

confusiono The separable prefixes are homographic with 

' - ' -1 ' ■ ■ " ' ■ 

prepositions. . Since many verbs taice prepositional 

objects, the student may indeed have a valid confusion. . 

'■ . , , ' ' . . _ < ' 

\ * N ■■■ " ■ ' * ' 

Introductory- Words : Question v/ords and 
subordinating words, must b6 "found at "the very beginning of 
.the clause. There is a possibility pf confusion froim an 
En^ish pattenn which allows a preceding adverb: 

"Whb's on first now?" ' . 

"Now what'H' we do?" \ ' • " . 

as well as the less common, contrastive: ' 

"He went where?" " 
The program takes sdme cognizance of these eri^or ' • . 

possibilities ; it can analyze a question clause even though 

; , ; ■ . ' ' . ■■■ , ' , ■■■ ■ " ■ ; ■■; ' \ ■ -y 

the question word is not; first, and complain later about ^ 

the position error. , \ ^ 



"Hicht" (not yet implemented): ' The adverb of 

■t 

negation, "nicht", presents. some- special positional 
prqbieins. Placed at the end of the inne^ fi^ld, it ne^tes 

r 

the whole clause; placed elsewhere, it negates only one 
element or conveys a different emphasis. The program will 
have to check for. legal positions as well as explain any 
variation from the expected position. 

Negative sentences piose another, related problem, 
particularly when only one element is to be negjated. The 
student may. use a negative article "kein-" instead of 
"nicht"; or, more pertinent to word ordering, he may place 
the element to be negated in the front field, complicating 
the positioning problems for "nicht". 

Particles; German lias a class of , words Icnown as , 
particles or intensif iers. These do not count as clause 
elements, and can appear almost anywhere between elements. 
A simple check with the expected sentence is about all that 
can be done here, (The current implementation does not 
check what was expected. ) 
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Chapter Nine 
• GONCLUSIOK . • 

This dissertation originated in a desire to fashion 
the computer into a tool for performing sophisticated arswer 
analysis, i.e., grading of student responses. I have 
provided a wor-king program which, while still modest,, does 
represent a successful demonstration of my principal ideas. 

In the course of the research, I have developed 
algorithms to perform many different tasks. Some of these 
algorithms are noteworthy in their own right. Of equal 
importance from a technical, viewpoint, I believe, is the 
assemblage of many routines into an integrated whole. The 
interrelationship^ of the, various parts allows the progi-am 
as- a whole to surpass its individual components and, in so 
doing, provide an indication of the more refined performance 
which will be available simply by upgrading the various 
constituent routines. 

• As another achievement, I claim a good beginning of 

a sound theoretical base for the analysis and explication 
of errors. My theory provides guidelines pertaining Bo^Ii 
to the process Qf pinpointing an error and identifying its 
probable cause. 

In all honesty, several aspects of the current 
implementation represent but a modest success, for one 
thing, the program can presently accept only a very limited 



range of granmar. However, the limitation is due largely 
to considerations of time and effort on the part of the 
researcher; I see no inaj or obstacle in the path of 
increasing the grammatical complexity significantly while 
•retaining the program's present, structure, 

Ihe program lacks a formal grammar, talcing 
assertions about the state of "fcompleteness of ^ the 
grammatical checking very difficult* ' Yet I would venture 
that a great many of the . interesting features of natural 
language may be most profitably considered as a collection 
of special cases; if this be so, a flexible approach, 
capable of easy accomodation of any special case checking, 
IB indised approp^^iate.' The issue of completeness can be ' 
settled by the Reference Grammar, by simply insuring tliat 
checks have bee^ programmed for all the appropriate rules. 

A serious shortcoming lies in the' limitation of the 
program to a single, . specified sentence. This difficulty , : 
is rooted in the basic design of the analyzer; even so, 
there are several ways in which the' program can be improved 
to soften the impact of the limitation'. For specific 
educational applications, "the. limitation is not always a. 
•terribly unnatural one. In sum, ^ this difficulty represents 
but one^of fflarF3r''°aspects of the program; it is not an ^ 
overriding failing, but must.be weighed in^the balance 
with the other positive and negative features. 



The text of this dissertation has' detailed many 
positive contributions. I misht list a few here in summary 
the ability to analyze entire sentences, i.e. , complex 
structured responses; the production of detailed error 
reports; the ability to cope with multiple independent 
errors; the assignment of proper credit to each aspect of 
performance even in the presence of related errors. Most 
important, I have aimed at construction of a £prader which 
would not be a liability to an instructional program, but 
rather would be able to lead the way to more imaginative 
teaching through its ability to gather large quantities of 
information on what the stjudent has done. I am 
particularly proud of ^ the program's ability to 
simultaneously follow, diverse aspects of the student's 
general performance • ^ 
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Chapter Ten 
. RECOMMENDATIOl^S FOR PIJTURE WORK 

There is no shortage of . possibilities for future, 
development of this project. The capabilities of the 
analyzer can clearly be increat-ed markedly on a number of 
fronts. The education^ promise of the grader can be 
realized by constructing an instructional program to 
utili-ze the information extracted by the grader. And on a, 
broader -scale , man^ of the general, pri.ncipies I have 
elucidated, as well as some of the specific techniques, 
could be applied to Computer-Assisted Instruction in other 
subject 'areas. 

- Rytensi ons of the Existin; ^ A l^Qr i tt-ims 

:The program has many loose ends. Some, like the 
.restriction to present tense verbs, are the result of 
simple constraints of time and effort in any particular 
area. In other instances, .such as word-order checking, the 
iiieomplete state of the grammatical analysis also reflects 
By need to have something to. show for the overall 
demoiistration of the grader . In the latter case , upgrading 
the program's capabilities will probably entail some local 

rebuilding.. 
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Overall, the program is in fairly good shape 
concerning extendahility, as some' oA<^e more recent 
additions testify. The addition of facilities to handle 
preposltibns, for instance, was quite straightforward and 
simple. ' A characteristic fimction "was created -to determine 
a word's raembership in the set of prepositions, implemented 
trivially using the LISP property-list feature. A simple 
function, rmodeled on those. used for noun phrases, 
established a structural context for the analysis of the 
prepositional" phrai^e ; the irechanics of scanning back from 
the phrase to clain: the preposition was already contained 
within the general executive routines. 

In ^mother instance, I once decided to have the 
program automatically adapt to the paraphrase of a noun 
phrase by an equivalent personal pronoum*. At the point 
where the program noted the absence of the head noun, all 
of the information was aljready available to specify the . 
pronoun, relying on the prograxn/s internal knowledge to 
determine the orthographic form as well* There, was no 
difficulty fabricating the neicessary search command, by 
duplicating -the equivalent Phase One activity; the 
mechanism for carrying out the search is precisely the 
routine already used for pronouns*^ ' 



* 33ie paraphrase feat^^ was removed during a subsequent 
reorganiziation^ It is easily reintroduced as a special^ 
case,*, but I have preferred to-delay until I could make it a 
part of a more comprehensive check on "e^xtra anci missing 
Q words # • 
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Here are some of the ways in which the program might 
be improved • - 

Additions to. the Grammatical Capabilities: 

There is no limit close at hand on the expansion of 
the program's grammatica], routines, even restricting 
consideration to the needs of a first or second year 
language course. The improvements suggested here represent 
those that seem the most immediate candidates at this time. 

Verb tense expansion, especially to past and perfect 
tense: This requires that the program Jnave the ability to 
distinguish the conjugatiorvOl patterns of verbs in the past 
tense,, not too great an extension from the current state. 
The formation of past participles is more complex, due 
to interference from prefixes and the preposition "zu". 
Furthermore, the .dictionary information associated with 
each verb would need to specify which auxiliary is required 
to form the perfect tense. More complex checks are 
required to automatically detect the substitution of one 
tense for another, particularly with the perfect where an 
extra word is involved. (Kote that the verb analysis is 
already designed • to delay its- decision whether 'a particular 
verb is indeed the finite verb of the clause, precisely 
to handle the case of an. unexpected (or missing) auxiliary.) 
A ratiier signif icaSrit effort is required to assign a semantxc 
interpretation to a interchange of tense once it' is 
detected. 
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Articles and Pronoung ; Some of my more recent ideas 
in error theory need to be applied to the pronoun analysis, 
.particular those dealing v?ith the representation of strong 
and weak correct. Similarly, the probability method for 
checking article inflections should be implemented and 
tested. More Hundane topics include article substitution 
analysis, which needs considerable refinement, inclufding 
some semantic consideration for certain cs^ses and pronoun 
forms relating to- the second person familiar, which must be 
handlcjd better. 

Negation ; 'The program should be able to handle 
negative sentences, preferably through an automatic 
mechanism. It should have internal knowledge of how 
to negate an arbitrary sentence-, as well as diagnostic 
capability. ' This will involve position criteria for the 
adverb "nicht" and rules relating to the use of the 
negative article "kein-"* • 

Word order ; The check oh the order of the ^sentence 
'field elemisnts must be implemented, comparing against both 
nbrm^ word order and the expected ordering. ? have • ^ 
addressed this topic to a certain extent in the main 
discussion of word order ( Chapter 8) . The problem is very 
difficult, involving a number of ioose3-y formulated -rules 
and appeals to semantic criteria; it may well require that 
additionsil information be provided as part of the sentence 
description. . • 
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Inflected Adnectives ; These will provide a profitable 
area for further work in canplex error analysis. The 
progrjamminfi will involva?ng attempts to match up the 
inflectional patterns shown by the adjective with thcsse of 
the noun and article, and all that against both what 
expected and what is grammatically correct. 

Additions to the General Analytic Capabilities: 

Detection and. recovery from misspellincs constitute 

.an essential part of further development. I am aw:>^^^; 
work 'on 'iiii£spe^..illsp^: in English. <ref . 18; ;Syi:ri9nds, 15^;c> 
and French <ref, 14-3; Scholl, 1972> involving consonant 
substitution and vowel elimination as part -of the matching 

'algorithm. Quite possibly siisilscr efforts exist fer 
German- I %h\vK ttet a moderatiEJ approach would proimbly 
suffice for this project, to catch such difficulties as the 
"sch" "phoneme, or the "is"-"ich" suffix interchange. 
Clearly some discussion with experienced German teachers is 
indicated. In terms of the program, however, I insist on 
attempting a maximum separation between inflectional 
changes and misspellings; ,1 will not have the misspelling 
routines interfering with the grammatical analyS; is, nor a 
grammar error detected only as a misspelling. 

Logic to- deal with the capitalization of nouns ^ and 
some pronouns would, represent a good addition. Some 
convention is needed for conveying the upiper and lower case 
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distinction on a teletype, of course; the search functions 
would need modification so as.. not to be distracted by the * - 
case-shifl^ indicator. As far aschedking is concerned, all 
the word analysis routines already call on a common 
termination function; that function could easily be 
expanded to do the necessary checking. 

Curriculum recGmmendations stemming -.directly from the 
analytic process siiould be made available. To clarify this 
mption, cdnsi-der tYmt the major portion of a grading 
ajsalysis report concerns what the student knows and doeis 
not know, information of direct interest to a teacher v;ho 
must decide which ctirriculum to present. There is another 
area in which the operation of the grader is pedagogically 
interesting, to wit, the limit.ations which two obstacles 

impose on the grader's ability to gather 'information: fthe 

■A ■ ' , . . . ... ■ j ■ 

inherent ambiguities in Genis^f*\ grammar and the shortcomings 

of the prograa' 's guidelines for the interpret3.tion of 

errors- - i 

The program is uniquely able to detect when it is in 
trouble; what it does with that informaiO-on is another 
matter, clearly indicating an area for future research. 
Currently, the weak correct diagnosis is the major ! 
current manifestation that the program has encountered a 
grammatical ambiguity; but not all of the inflectional 
checks (and none of the word order checks) correctly detect 
weak correct sit uaticmsv In addatibn to merely reporting 
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that a weak discriffiir^ation occurs, the proeraiii ought to be 
able to point out how a different formulation of the 
expected sentence would have avoided' the particular 
indeterminacy. ■ ' 

Consider a simple example, taken from inflection 
checking. The student is to produce a dative neutdr 
article, i.e., "dem", but instead has written "das". His 
mistake can be interpreted in two ways, either as an - 
accusative neuter (not a bad guess if dative is correct); or 
-a nominative/caseless neuter. As both have the sanle 
form, there is 'no\ay for the program" to be at all sure 
what the student had in mind. However, if the sentence had 
been such that a masculine - article was required, the 
particular inability to distinguiish nominative from 1 
accusative would not occur. Por- the prograjn.to make a 
comment to that 'effect, such as "try a masculine noun in ' 
order' to > clear up this particular ambiguity," should not^ - 

reouire a very great addition to its grammatical 

■ ■ ■ - 5 

capabilities. 

In the discussions of various techniques of error' 
interpretation for inflection, I described an algorithm 

* ■ - 

which would have the ability to figure out the appropriate 
cur:i^icular recommendations to get around grammatical 
ambiguities.* Implementation of that algorithm wou3-d seem 
a. good place to begin. Por word ojder, I h^ve not yet 
really addressed the problem of reporting- analytic ambiguity 

See Chapter '7, description of probability method.^ 



and the ways to -avoid it. The level of confidence given by 
the PHRASEDIVIDER. package reflects analytic troubles, but , 
those du§ to grammatical ambiguity are not separated from 
general confusion by the student. I have a few general 
ideas of how to proceed with the necessary work; J am 
confident, again, that^the additional capability will 
represent a fairly natixrai extension of the existing 
program.. » 

Additions to Increase the Program ^s General Awareness : 

The 'Search functions, should, pay more attention to 
conteSct. For instance, they sJriould consider whether a' - 

.preposition been seen, or if an expected combination of 

•■':V- ' ^ ' ■'■'^ 

ad'jective and nouji is present. ,Tp achieve such increased 

attention will require more elaborate preprocessing 
operations diiring Phase C4ie of the "analyses. For the 
program to be aware of the varipu|i.^^n textual considerations 
which will be , needed, a number of )data structures will have 
to be set up for a ^ije'cific sentence.^ Furthermore, each of 
the search routines will need modification to provide for 
handling additional parameters, and to introduce a certain 
amoiliht of in terpomaiunicat ion between the individual search 
functions- . - ' 



* See: discussioh of limitations in the word match 
algorithm. Chapter 3- 



DepeMent clause analysis is currently hampered by a 

• . " ' ' ' , . • 'X 

certain ya^eness -in thfe. setting of clause ^boundaries* 

Some, pf ^he dtfficiilty steins from the dejperral 'of the " . v 

.pronoun ^gearch until the very- end of the search: ..During J 

^ . * ' ■ ■■ " ^ — - ' 

the'*re^^ul'ar scanning of a clause, a correct instance of .an 

expected pronoun should 1:^ immediately claimed by the 

lappropriate f^anction, just as is \done for nouns and other' 

major words* In" any event,, even when deferral is ' ^ . 

necessary, the pro£;ram sijould l:>^\£enerally aware'^^T^^ 

•existenGe of unclaimed pronouns in the interior of the... 

clause, 'SO that it can tell whe.tiier to proceed past the 

last major. word of a clause diiring the pronoun search. 

'Note that if the<;Student's syntax is con-ect, there is no 

problem determining thie^dlause boundary;^ thie dependent 

clause'will. be, delimited on the front by the subordinating 

word and at the end -by -th^ -finite verb. - But the student 

may not be "borrect , in which Case a pronoun may be the last 

"word of the clause he Cr.eates^ ' - 

« ' - r ' "* -I ' ^ 

*■ 1 - . , V; - 

J ■ -I - _ • w ' ^ * " 

^ A major area for additional' vforlc^ concerns ''the 
analyzer's response td word presence errors: omission of 
expected words, addition ^ of • unexpected iextra words,, or "'. ■ 
substitution of one word for another* This topic? is a 
Pandora's Box, as it quic^y turns, intoia. question of 



analysis of arbitrary naturajt language, - a problem I'd - 
, rather stay away from at this point. jNonetheless^ there is 
quite a bit of room for improvement even within the general 

Q 

constraints of the present research. 
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Pure omission is probably the easiest of the three to 
; handle, as there is no need to.cope with unknown words and 
their structures; yet a meaningful interpretation of the 
omision may be somewhat tricky, particularly if what the 
student has produced does ha.ve an interpretation in correct 
German, Pure addition of words, on t^ other hand, most 
quickly becomes uhmanageable. Given access to a 
dictionary, the program might be able to identify some 
tmanticipated words and deal with them using a small set of 
heuristics- • 

Substitution is pe*: haps the most interesting of the 
three types of , presence error. It is strongly related to 
both omission and addition. When a -substitution occurs, 
the expected word will not appear,^ an omission; an extra one 
will be present instead, an addition. IJ there is any logic 
behind the substitution, ,|l;he infonnation about the expected 
word will still be partly applicable; e,g. , noun, head word' 

.; ■ i . ; ■ - , ^ . ■ - ■ - - - . ' . I 

■ - ■ - . !,- 

of an accusative object, etc. Pronoun paraphrase is 
probably best treated as a special case of substitution. 

When changes from the expected occur in the actual 
lexical words the student employs ^ the program must be 
prepared to respond with some diagnostic comment on the 
resulting semantie implications. ' This again is a v/ide open 
subject. Two observations, however. Pirst, if the program 
can identify the unexpected word, using a dictionary, then 
ErJc perform the normal inflectional checks relating to 



that word even without knowing "t^^ meayiing. , Secondly, the 

task at hand has oeen defined as the teaching of German 

" "" ■ ' - ■ . • 

graimnar, and detailed discussion of finer semantic points 

, ' ' . • * ■ ' » 

might well be ideferred to another time. ■ . - 

The program currently .detects missing words in one of 
tWQ ways. If a major word is missing, its WDUST entry 
will remain even when the end of the clause has been 
reached. .The absence of function and other words. which are 
the subject of directed search will be noted at the time of 
that search. The absence of major words is currently noted 
in a most cursory manner; an obvious expansion would be to 
follow the cross-reference pointer to the corresponding 
SDLIST entry so the progxam could at least refer to the 
entire clause in making its complaint- That point would 
also be the time to consider whether a substitution may 
have occurred; most likely the program will have to maintaiti 
a list^of linrecognized words from the main scajn for 
reference at this point. : 

All the SDLIST functions, at least those 
cdrresporiding to top-levei clause elements, would have, to be 
made sensitive to the prospect of being activated on a 
major-word missing basis, as well as the curi-ent major-word 
found condition « The case of pronoui\J)ar"apbrase is an 
example of the kind of complexity that must be handled: the 

* The Inevitable special case, in which either the new or 
old word has some grammatical anomaly, should be noted in 
that word's dictionary entry. ' , 

' ■ . , ■ ■ -■ , ■ • . . 
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SDLIST element would be activated first for thQ missing 
noun* Later,, if the pronoun were found, it wou|ld be ent^-rre^i 
again 'as part of -the pronotin ajnalysis* if the pronoun were 
nat^j>resent, then action appropriate to the omission of the 
original noun would be necessaryo Other substitution 
analyses will be equally, if not more, coEiplex, - 

^ Tor those words whose omission is noted during a 
directed search, a different strategy is needed. There as, 
of course, no problem detecting the absence. But becaLise 
part of the clajise element will be present in this case, 
the errorinterpretaticn may well' be more involved tte^i'i 
merely stating that such and such is absent. Yor instance,^ 
if both the .article and adjective are missing from a noun 
phrase, tiat absence should probably be treated as a single 
event, not as two isolated errors* The solution seems to 
require that the [SDLIST] analytic functions do iTiore than 
just activate lower level analyzers; when the lover- level 
work is done, the calling fiinction must delve into the , 
resulting data structure and determine what has taken 
place, assign an interpretation to it, and so forth. For 
example, the Noun Phrase function ciirrently just sets up a 
context for inflectional checking* Under the proposed 
improvement, this function woiild have the responsibility to 
notice other features pertaining to the noun phrase as a 
whole. Per instance, if -an expected article is not 
present, the noun, phrase function would decide why th^ 
article was needed in the first place and produce an 
appropriate comment relating to its absence. 



The same functions would appear to be the appropriate 
location for heuristics to handle detection and analysis of 
extra words. The middle "of a noun phrase is a good le^jal ' 
place for an adjective, for example, and the Noun Phrase 
function should do its best to accommodate the student who 
insists on /doing mo;re than he is told^. Perhaps the program 
sh^ild be' sensitive to cert^ain prob&ble errors , much- as 
this runs against my general philosophy of not looking for - 
specific anticipated errors in the grading, process. Thus 
in,/a construction in which one noun moddfies another, the 
^^English pattern may gain the upperTia^d : "Eringen Sie irir. . 
ein Glas von Bier" ("Bring me a gie^gs made out. of beer"!) 
<ref. 7.5; Kufner>. An extra preposition might well be 
anticii}a,ted- "in this and several other environments. - > 

^Creation of an In structional Program 

. : . : ■ ' ^ 

. \ . - ~ ■ • ■ ■ 

...... 

The grader was designed for use by an enlightened - 
computer tutor. ^Although the project is still ve:^y liir.ited 
in sco^e, the essential characteristics are how fairly 
clear, enough .to allow research to begin on an 
instructional system which Vfould use the grader s special 
analytic capabilities. I am not entirely sure what form 
such a, system would take. One of the desigrt criteria v;as 
to provide a grader which would be an asset to a variety of 
instructional systemis, rather than one rigidly linked t'o a 
specific instructional philosophy. 



0 
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I see the [computer] teacher as bein£ quite , 
responsive the individual student. A basic prerequisite 
of individualization is that the teacher have specific 
knowledge about thie student; probably the more the 
individiialization, the great ei: the amount of inforication 




needed. My. grader handles lar^e responses not just because ,- 
^ they are pedafiOGieally desirable, but also specifically to . 

speed up the flow of information from the student to the 
teacher i In this context, Th6"v^t' amount of diaenostic 
. inVormation takes c3n a new usefulness; rather than 
r^resentine absolute verdicts for immediate (or nearlj'^ 
ii5^diate) action by the teacher, the. crader reports may 
just produce, minor adjustments of the various ijaranieters in 
a model the teach'er might maintain to reflect the student's 
understanding, Curricular attention could be directed t'o a 
difficulty as necessary" (an arbitrarily comple^p condition) 

»> - ■ ■ • ■ - ■ ■ . - 

perhaps after a series of small errors, or a pattern of 
scattered errors with one final straw clinching a hypothesis 

\^learly a £preat deal of developmental effort V 

' the appearance of operatiorial instructor. I would hope 

that the resultant i)rogram i/buld have sufficient diversity 

and flexibility to i>pse an answer to the following eloquent 

c, criticism of mechanized iristructionc 

" The skilled teacher liias maniy objectives to 
work from. She is not concerned about 
getting them SL^ 

opportunity to do pBitial q^ ' 
They will reassign goals on the spot- The - 
reccmsideration of priorities is t^ 
important piii^chase that we make when we chose 
o skilled teachers rather than programmed ^ 

f RIC - material. ' <ref . 16; Stake, 1967> 
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Applxca-tions in Other Areas 

Close to home, the analjrtic techniques, of this 
proerajB should be applicable to the sradin£; of other 
natural lansiaa^jes. As noted in the introduction, this 
jjroject is very intimately involved with the specifics of 
the Gerinan language. Nevertheless, many of the general 
principles used by the grader have wider applicability. 
'Special consideration will be necessary, of course, to 
handle the i>articular features of each language; but much 
of the underlying architecture may be useable. Thus, for 
instance, French artiq;ies must agree with the noun in 
number and gender* Althoiigh forms and p>at terns are 
different from German, the general notion of guessing v/hat 
the stduent hdd in mind is still valid. Specifically, one 
of three mel/hods ( see Chapter 7) would probably be directly 
applicable, albeit with a different set of parameters. 

other Sub.iects ? , 

• In other subject domains, the grammatical analysis 
capQbili:6^^will be less pertinent than th.e general 
pliilbsophy of my approach to ;grading. Language teaching • 
has no moAppoly on the notion that grading should be done 
via ^^s-^^gram that ^ is competertt in' the subject domain, that 
cososideration must be given to questions of partial credit 
and multiple error s>v and that reasonable explanations may 
exist to account for the students efrors. 



For a simple example, consider a problem that arises 

It- - " 

in elementary music theory courses: a short series of notes 
is played for the student, who mvist write them down. The 
student is not expected to have absolute pitch, only 
relative pitch; the object of the exercise is to train his 
recognition of intervals and transitions between notes. 
Therefore I would expect a grader program to have at least 
a miniiiial ability to transform a response to a different 
key. v( For instance, if 

presented, t^ grader must be a^le to recoenize as' . 
equivalent "A E B", "G ej(i^ A" and'many others. ) 

To continue into the problem more deeply, consider^ 
the following difficulty. For simplicity, I will assume ' . 
for the moment that the student's first note will always be 
considered as correct (and use }it to indicate to the grader 
the required transposition of the whole response ) . The 
student may Specify the second note incoixectly, but set. 
the third one right. Clearly, a theory of how- he is going 
about his task is needed. If he is merely recording the 
interval between successive ncbtes, then he has made two 
errors: first to second, and. second to third. ^ 
Altemativeiy, if he is also functioning in soine global 
context in which individual notes are related to all that 
comes l>ef ore , then perhaps he" has. committed but a s£^g^ 
error: the improper specification of the s^ond note. 
Notice the associated questions of whether a single error 
should provoke a transposition of the^remaiiider of the • 
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response, and indeed whether there misl^t be an error 
lc>calized to the first note vhicl> would also affect the 
particiilar transposition performed. 

•Continuing the analysis a bit further, one misht 
consider the conceptual underpinnings of the problem. 



Perhaps the seouence of nd>tes is somehow related to specific 
harmonic intervals^ In that case, it v/Qiold be worthwhile to, 
check whether a wron^ reisponse coiold be 'accounted for by 
the hypothesis that the ktudeht has chosen a different but 
still appropriate intervial, -Other ' errors irisi^t be traced 
to the student not knowins the correct formulation of a 
particular interval. I . , 

In sum, clearly competence in the subject domain is 
-necessary but not sufficient. A comprehensive aJpproach to 
grading requires two other components^ attention to yar ioiAs 
aspects of error analysisi and a way of inte£;3ratins 
information , about the e^t^jected . response with the 'ability .to 
analyze the material standing alone, I have provided a 
working example of a grader for German grammar instruction, 
aaid hope that other projects will Home to apply what I ha.ve 
learned- for th^ir own subject areas. 
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Appendix A' 



A Brief Presentatioin of Some Portions of . Standard 
German Grainmar, with Particijlar IiDi)ort 
for Word Order. Analysis ' 



The Reference Gra mmar of th e German Language 
<ref, 9.4; Ledere3>-Schiilz-Griesbach> gives a quite 
straightforward discussion of i the hasic structure of a 
sentence: 

The PREDICATE is the most essential 
functional unit of a sentence, and includes 
all verbal, forms of the sentence as well as 
many verbal complements. It forms the 

SENTENCE CORE. 

, <ref. 9.5; sec. H110> > 

Sentences . consist of one or-more 
* GLAUSES , each of which contains SEMTEWTCE ; 
UNITS (words or phases) which can be 
recognised as, constituent members of the 
sentence. "These sentence units are placed 
into a greumatical and^ ^^1^ " 
tb one another ty meaiis of a predicate 

<ref. 9.6; sec^ H0G2> 

The' "^Germah •predicate" con;^sts of the verb and certain 
closely associated complements. All other parts of the 
sentence, including objects, (whiioh in English are casually 
part of the predicate) the subject,, and varioias modifiers 
are considered under thie classification Of "sentence 
Units". They aac^ grouped^ tc^ther into something called 
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the "Sentence Field", which is most stron^y identified as 
a description of physical location within; the clause. The 
sentence field is delimited on both ends by the split 

■ ■ • ■ ■ i ■ ■ . ^: - - J" . ' ■ • ■ 

predicate (whose parts are labeled VI and V2;, as shown: 



. Anterior limit ■ Posterior- limit 

I sentence field | 



"Hast du das Buch gelesen" 



<frgm |ref. 9-7; sec. 



~H022> 



The full declarative sentence has two other slots, 

an end field following the: second part of the predicate, 

arid a front or* 'prefield before, the first part. Tiie end 

field contains one r.entence. unit whi^ch must be one of 

certain special types'. The pre-field is usually assigned 

excessive importance by non-German;^' because of our strong 

association of the posi tion ^ before / the verb with the 

grammatical subject of the sentencp. In German, the 

pre-field has very little syntactijc importance; it does not 

even exist in' many , types of clE.uses. When it is present, 

its function is mainly semantic link to what has come 

before) ; syntactically, it is treated as a trivial 

positional* transformation: I 

. JDECLARATIVE STATMENTS normally have one * 

sentence unit preceding the! first part of the 
predicate. This unit in the prefield of the - 
Q sentence fulfills the functidn of a. contact 



member ,, with the preceding sentence of a 
connected utterance or discourse, or serves 
to arouse interest. by referring to a 
particular concept- It neither lose? its • 
^anunatical and logical function within the 
sentence, nor its claim to its normal - 
position within the sentence field, to which , 
it reverts if the sentence assumes a 
different form. V • '. 

<ref. S.S; sec. H051> - 

The movement of V1 to the position folldWing V2 
(the famous verb-last order of relative clauses) is. . 
similarly treated as a simple positional transformation- 
with 'no. further .significance on the -sentence stiructure.* 

I have one change in the Lederer-Schulz-Griesbach 
scheme, in their particular distinction between predicate 
complement and predicate.. modifier. 1 will not deny that 
the notion of predicate complement is valid for the German 
language, nor the right of the teacher to present this 
concept as he chooses. For the grading purposes of the 
program, however, I would observe: 

1. The distinction is subtle, and thus bne may expect 
that a student will fail to malce it properly, 

2, Because the distinction 3:s subtle, it is easy to add 
it as a special extra consideration for checking x 
clause-element positioning. (Even the Reference Grammar 



* . This is not quite accurate," as there Is sometimes . 
interaction between VI arid p^i^^s of V2, but that is a minor 
"consideration entirely intemal\to the predicate. 



tends to consider predicate COTiplements along with sentence 
elements when discassin^ word order <ref. 9.9; sec. 

Some compleiaents er^ definitely tied to the verb,, 
and appear with it in a dictionary.; the "separable 
prefixes" are a good example. They are further 
distinguished in that they cannot appear in the prefield; 
the example s20a is not possible in correct German: 

s20- Der Zug faengt an. (/^^^ train starts up) 
's20a. * An/faengt der Zug. ' ^ 

( inarks an incorrect sentence) . 

These complements clearly do belong with V2, as^ 
they are quite closely tied to it structurally. A large 
number of the predicate comiiLeDients, howeyer* are less 
strongly attached. 

•; ■ V . . ■■ ■'/ -. ^ ■ ■■■ ' .. . - \- • 

s2i. Hans wohnt in Berlin. (Hans resides in Berlin. ) 

s21a. * Hans wohnt. (Hans resides..) 

s22. Hans arheitet in Berlih. (Hans wofks in Berlin. ) 

s22a. Hans arbeitet. (Hans works. ) 

- - " ^'^^ ■'•J ..." . • , ■ " ■ 

s23. In Berlin wohnt er.' .(He resides in Berlin. ) 

s24. In Berlin arbeitet er. (He works in Berlin.) 

S21a is not a complete clause, and so the prepositional 
phi«se. "in Berlin" is formally considered a predicate 
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complement. S22a, on the other hand, is complete, and so 
the same phrase "in Berlin" is considered to add 
"non— essential" information and is considered a predicate 
modifier, which ;ls part of the sentence field. Both s23 
andoS24 are grammatically correct, however. (s24 may be a 
bit unusual, but it is grammatical.) 'FoUiowing Kufner 
*<ref* 7»4>i I would like to use an extended notion to apply 
to both sentence ^ elements and loosely— bound predicate • 
complements* ^ / « 

The particular predicate complements which I want 
to clasgl/y as clause elements can easily be confused with 
-the modifiers • Thie Reference jGraimii^ the 
narrowness of the dividing; line, and offers as a test the 
criterion of necessity (as discussed above, s21 vs s22) 
<^^f* 9*10; sec. ii233>^ The predicate complement is by no 
mieans the only required element in the sentence; the 
primary object is equally indispe^sible: 

s25: Ich kaufe ein Auto. (I buy a car.) 

s25a: * Ich kaiife. (I buy..) 

s2^: Ich bitte das Brot. (I ask for some bresad.') 

s26a: . * Ich bitteV (I ask ..) * 

.S25a and s26a are not complete clauses, because they lack 
objects. Since an object may be a preposltipnal plirase. in 
.German, th6 distinguished position of the predicate 
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complemert "becomes further ercxied. Comi^re the three 
examples given so far: ^ - 

s21: Hans wohnt in Berlin. (Hans resides in Berlin.) 

s22: Hans arbeitet in Berlin. (Hans works in if-erlin.) 
s26: Ich bitte ura das Brot. (I ask for some bread.) 



Based on the above discussion, I claim that the 

distinction of predicate coinpleiaent is not a nia.ior 

structural one. The dis^-nr 'ishine features are tenuous 

even for a formal theory; axpect a clear distinction 

from students in an elementary course is unrealistic. I 

feel tliat incorporatirie a complexity which is not reflected 

'in the behavior of the student represents a hindrance to 

the error-oriented analysis. 

- . . ■ ... . /■ 

I have, therefore, adopted a' simply scheme than 

that of the Reference Grammar. I u^e^alner's teiJii, 
"clause element",- which he defines as any constituent that 
can occupy the prefield <ref, 7.5>i In particular, clause 
elements include all the Reference/ Grammar's sentence 
elements, as well as the less closely bound predicate 
cpmplenient. I make up for the change by insisting that the 
distinctiori between predicate coafclement ' and predicate 
modifier be kept for determining [the proper word order. 
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Appendix B 



Table Used in Inflectional Analysis 
for Intent-guessing Method 1 



Input: Article ending + noun inflection + erpected value, 
Values: Intent-guess, with c.onfidence level. 



DescriBtion : 

Data which will conie from the student's response: 

actual article ending — "ending" 

noun stem/ending — "noun:" + "s" or "p" 

/ "s" — (singular) no change from stem form 

/ ^'p" - stem arid/or ending changes, which: ' 

"p (-n)" - include '-n' ending 
"p other" - no '-n' ending 

Data yhich will come from descriptive information supplied 
to the program: " 

proper case - — "expected case" 

proper gender — (just indicated as "m", "n", "f"; 

or if no discriinins;tion is needed 

proper number — not considered in this table 



ERIC 



- 186 - 



Data inherent in the table: i 

Prom grammar and past performance: 

slots for whicli the ending is correct 

— given following the .ending. Format: 

. < case ntimber gender > ^ 

' is used when any value will do, as 

in plural where there is no gender 
discrimination. 

a list is used if more than one value 
is appropriate, as "(nom acc)" for 
either nominative or accusative. 

comhinations which the student may have intended; 

includes all correct slots, plus plausible 
additions to accomodate likely errors: 

• "possibilities*' for each case heading 

*** probability/confidence that a given combination does 

indeed represent the student's intent, as 
^ovm by his performance 

— ; entries in the chart, given as: 

"veryhigh" 
"high" 

"med" (medium) 
"low" 
blank for "none" 



+ 



Actual Table : , 

On following pages, arranged by ending- 
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ending: -er 
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as in "der", "dieser", "einer" 



correct for: <noiii s jsl> <dat.s f> 

("ein-" forms only correct for <dat s f>; 
included here,-, will be checked separately) 



Nominative 

<nom s in> 

veryhigh 
high 
high 

med 
med \ 
med 

Accusative 

<nom s m> 

med 
low 



■Expected case: 

Possibilities: 
noun: s 

n ' 
f 

noun: p 
m 
n 
f 

Expected case: 

Possibilities: 
noun: s 

m 

n' 
f 

noun: p 

/ ■' ■ m ^ 
n 

/ f 



Expected case ; Dative 
7 ' • 

'Possibilities: <nom s in> 
noun: s 
- m 
n 

f .. 
noun: p (-p) 
m 
n 

■ f - • 
novnt p (other) 
use accusative p entries 



<nom p *> 



med 
med 
med 



<acc s" ffi> <dat s f > <dat p *> 



high 

med 

med 

med 
med 
low 



<acc s m> 

med 
low 



low 



low 
lev; 
high 

low 
low 
med 



high 
high 
veryhigh 

low ; 

low 

med 



med .■ 

med 

med 



<dat is -f> <dat p *> 



low 

m^ed 
med 
high 



ERIC 



Comment: The ,<acc s m> category probably should have been- 
<(iiok acc) s m>, to indicate blurring of the case distinction. 
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ending: -es as in "das", "dieses" 
correct for: <(nom acc) s n> 

Expected case: Nominative 

possibilitesi- <(noiii acc) s n> <(nom acc) p *> 
noun: s 

m high 

n veryhigh 

f high 

noun: p 

m nied ined 

^ raed Died 

jr med raed 



Expected case; Accusative 
noun: s 

use nominative entries comm 

noufi: p 

use nominative entries 



Expected case; Dative • 

noun: s / ^. . 

use accusative entries, with diminished confidence 

noun: p ' 
use accusative entries, with diminished confidence 
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ending: -e as in "die**, "eine" 

correct f9r: <(noin acc) s f> <(nom acc) p *> 

Expected case: Nominative 

possibilites: <(acc tfM) s f> <(acc nom) p *> 
noun: s 

m high med 

n high med 

f veryhifih — med^ 

noun: p 

m low veryhigh 

n low veryhigh 

f med high 



expected case : Accusative 
noun: s 

use nominative entries 



noun: p 
use noaiinative entries 



Expected casv.-: Dative 

* ' ' ■ ' 

noun: s 

use accusative entries, with diminished confidence 
noun: p 

use accusative entries, with diminished cnnfidenc^.. 



CoBBQent: built-in bias for plural will miss those nouns 
which do not have a special plural inflection. 
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ending: -en as in "den", "einen" 

correct for: <acc s i]i> <dat p *> 

Expected case: Nominative 
"•confused" 



Expected case: Accusative 
possibilities; <acc s iii> 



noun: s 



veryhigh 
high 
high 



Si 
n 
■f 

noun: p (-n) 

m1 ' ,high 
• m2 veryhigh 

n low 

f low 
noun: p (other) 

m 

n" 
f 

Exj€cted case: Dative-;- 
possibilities: <acc s m> 

nouii : s 



<dat p *> <dat s (m n)> <acc p *> 

low low ■ , 

low 



ired 
low 
high 
high 

med 

med 



<dat p 



oed 
sied 
flied 



*> <dat s (t-: n)> <acc p 



*> 



:ric 



m. 


high 


low 


■ med 




n 


med 


low 


^ med 




f 


med 


med 


low 


C 


p (-n) • 




veryhigh 


,9 




ml 


low 




* 


ni2 , 


Died 


high 






n 




veryhigh 






f ' 




veryhigh 






p (other) 










ID. 

n 




med 




low 




- high 




low 


f ■ 




high 




. low 



i 
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ending t -em as in "dem", "einem" 
correct for: <dat s (m n)> 

Expected case: Nominative 
• "confused" 



Expected case: Accusative 

possibilities: <dat s (m n)> <acc s ei> 

1 

noun : s 

m high med 

ri high 
f med 



noun: p (-n) 
use dative entries 

noun: p (other) 
use dative entries 



Expected case: dative 

possihilities: <dat s (m n)> <dat p *> 

noun:, s 

ffi veryhigh 

n veryhigh . 

r ^ high 

noian: p (-n) . 

m low high" 

n low . hi^H> 

f hiii' 



noun: p (other) 

m med low 

n med low 

f low med O 



o 
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ending: nil ^asin "ein" 

correct for: <(noin acc) s n> <noin s m> 

Expected case: Nominative , ^ ' , \ 

possibiliti'^^s: <(noin acc) s'n> <nom s (.m n)> <(nom accj 



noun: s 
m 

f Uow med low 



low , veryhigh 
veiyhi^b. low 



noun: p 

m 
n 



<(noin acc) 
low low Died 

low low Died 

f low low K.ed 



Expected case: Accusative 

-noun: s " * 
use ji,9jainative entries 

noun: p 
use nominative entries 




• X 



Expected case: Dative, 
noun: s . 

use accusative entries, With diminished. cent idence 

noun: p , , ' r.. ' 

ur^e acciisative entries, with diminished confidence 
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Appendix C 

FULL CHART OF PROBABILITIES, IOTEI>.T~GUESSIM} METHOD 3 

■ — Chart of P(intent[i]) for: all case-number-gender 
combinations. 

— Chart of P(form[3] I intent[l]) for alJ plausible 

endings, for all case-riuiiiber— gender combinations. ^ 

f <-v. 

Introductory coiiimeiil|s: 

aIi probabiliitaes are referred to the actual expected . 
slot. Because the expectation cannot te usefully described as 
a probability distribution, there is not a true conditional ^ 
probability for intent; i,^- it is not correct to consider 
P(intent[i] I expectsttion[k]) . The actual situation is quite 

close, however- For purposes of this analysis, the proper 

.* ^ ' , * 

approach is lo consider that the case-number— sender expectation 

■' ' ■ ■ ^ ' . , ■ ' ^ ' ' . 

specifies one page (^.n a tabulation of probabilities) on which 

the relevant intent [i] probabilities can be found. A separate 

psLQB must exist for each possible case-nuober— gender ' ^ - 

combination. I make two assyiaptions wi^ich simplify the 

construction of the table: ( 

1. The conditional probabilities, P(form[i] I intent[jj), 

for the various orthographic forms do not depend on the 

expectatiori. In other words, although P(intent[i]) depends 

on the expected values, P(forin[j] I intent[i] ) does not. 

For instance, if Jthe student actually intends <acc s in>, the 

probability that he will use "-en" is the same regardless of 

whether tha^ intend is correct. Therefore, only one set of 

form[j] probabilities, P(form[3] I , intent[i]) , is necessary. 
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2. The student's l)erf crmajice on each graniEaatical 
category will be independent ; i.e., whether he can use. 
the correct gender for a noun does not depend on the case 
in which it will be used. ' This assumption allows the . 
intent probabilities to be constructed- simply as a product of 
terms for each grammatical ^category. For example, to 
compute P(<acc s m>) .with an expectation of <nom s m>, 
simply calciaate the product of*+'he terms for the individual 
grammatical categories. ; 

— case: should be nom, intent is nom; use P(accnom) 

— n^jmber; shoi.ad be s, intent is s; use P($amenum) 

— gender: should be m, intent is m; use P(saiiiegen) 

So P( <acc s m>)"~= P(apcnom) * P(sainenum^) * P{samegen) 
for the expected <nom s m>. 

The following is lhe complete chart of probabilities needed 

for the" analysis; I have gi|ven both symbolic terms and ad-hoc 

i ■> ' 

estimates of reasona' lumerical values. 



P( intent) for specific valubs : 



CASE: 
Expected 
nominative 



accusal, ive 



dative 




Intended i - ^ 
,nom: Pfnomi^om 
acc: Placcnom 
dat: P(dathom 

nom : P ( n omacc ] 
acc: Piacdacc^ 
dat: ■P(datacc. 



= .04 

= .90 
= ;G6 

nom: PCnomdat) = .05 
acc ; P( accdat ) = « 01 
dat: P(datdat) = .85 



( 



Rir 
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GENDER: 

intent same'^'as expected: 
intent different: 

WUKBER: 
"intent . same as expected ; 
intent diff'srent: 



P(saEe£:en) 
P(diff£en) 



Pfsamenumb 
P(diffnuip.b 



/ 

15 

,65. 
.15 



P( form I intent) for case-gender-^iimber c6abinations: 

" (no gender distii^iqtion in plural) 



INTEIiT 


FQBli & PROBABIUTY 




nOEinative 










masc 


er. 


. 95 






neuter 1 


es. 


.95 ^■ 






fem 




.95 






plural 


e, 


.95 






accusative 










masc 


en, 


.85 


eF, -IP 


em. 


neuter 


es, 


.95 






fem 


e, 


.95 






plural 


e, 


.98 






dative 










masc 


em, 


.85 


en, «10 


er. 


neuter 


em, 


.90 


en, .10 




fem 


er. 


.95 ^ 


e, .05 




plural i 


er., 


.85 


em, .10 





.0^ 



.05 

'05 



(In a nvunber of instances, I could not come up with a plausible 
wrong form for a particular line of the table, even though 
there is clearly a finite probability that the student will 
produce a wrong answer. It is for this re^ison that the sun. 
of the probabilities is not always 1.0, ,to acknowledge the 
existence of an unspecified "other" form.) 
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Appendix D ^ 
Detailed"^ Lescritpion of Verb Inflection Analysis 

Verb Gndin£ analysis includes three inilections: 
■y^^^^^^ubject, verb stem^ and verb ending. The analytic 
iStrategy must first determine the £xaimtatical slots 
represented .by 'each inflection; then, various 
comparisons can be made. 

' Stem: 

The stem infloction is compoised of the forfis 
taken by the stem vowel and stem consonant- The 
characterization is based on l.ie follovin£ scheme: . 

"I" - if the form is the same as the infinitive 

forEi of the verb; 
"s235" - if the form is the same as tliat taken/ 

by the verb in 2nd Sr 3rd person present ; 
•»*" - if the forift is the same as the infinitive 

and the verb does not normally show a 

distinction in this category for s23; 
"X** - otherwise. ^ . 

J Having arrived at a characterization on this scheme, 

the pro-am follows a simple decision tree to arrive at 

a composite specification for the stenii as a whole: 7 

»«« _ if both stem consonant and stem yowelxare "*'^;/ 
t.j« _ ii* both are "I", or one is "I" arid othW "*", 

or stem vov;el is '*!*• and stem consohaht "X"; 
"s23*! - if at l^t onfe is" "s25" and neither \is " I"; , 
^ "C" - (Confused) otherwise.. . - 

Pro5^_„th'e ccanposite specification, ^the next step, 

is a ii table reference to obtsuin the corresponding 

(^"et of slots. Verb types are specified by sin£le~bit , 
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nunbers (in octal); if more than one t:>'pe applies, 
the corresponding hits are "OR"ed together. For mtiltiple 
possibilities other than type, separate slot terns are 
employed- . 

« - <*,* 7>^ (all categories, verb types 1,2 and 4); 

. I - <* p 77>, (plural, all verb types), : ' ^ 

<1 s 37>, (1st singular,, all types except 4G),„ 

<2 s 7>, (2nd & 3rd person singular, verb types 
<3 s 7> 1,2 and 4) ; 

s23 - <1 s 40>, (1st person singular, type 40 / modals), 

<2 s 70>, (2nd & 3rd person singular, verb, types 
-<5 s 70> 10, 20 & 40 / all those that charge). 

ENDING: 

The ending, if recognizeable, represents graimnatical' 

slots of the same ^.composition as the steni. Again, multiple 

verb types ar^ given as a union of bits- The oiily verb type 

distinction really present here is whether coverage includes 

modal auxiliaries (type 40). The ending table is: 

Kull ending — <r s''40>/'<3 s 40> 
«_e" J- <i s 37> 
..„t" — <3 s 37>, <2 P 77> 
"_en" <1 p 77>, <3 P 77> 

-eDMPARlSQJl--PPERATIOWS : 

The first coiaparison operation constructs the ? 
steia/eTxiing specification- The program evaluates all 
pairwise comparisons between the set of stem slots and the 

set of ensfirig'iLots. If the values of > b^^ person and 

" ■ ■ \ . ■ 

number of the two' slots match exactly, a resultant term 
is generated with those two values and ah intersection 
figure for verb type. The, latter is the logical "AND" . 



Rir 
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:/■ ■ ■ ■•■ 

of tha^ two fi^ureis, representing .a set intersection as 

the type number^ are associated'" with individual bits. ^ ' 

For the resultant person arid number terms,, is eliminated 

>f possible. For example:.- * • 

; <2 s 70> with <2 s 77> gives <2 s 70>' 

(person, number match; 70 ."MD" 77 =' 70) 
<2 s 70> with <3 s 77>^ gives UIL 
(person mismatcli) 
' * 7> with <1 s 40> eives <1 s'0> 

(•jperson okay — ^matches anything;, use 
. non-* value as resul.t* Ditto for number. 
Type, 7 "AND" 40 = 0 [null] ) 

The last example stiows a resultant slot which applies 'to 

a null verb type* It is carried along to introduce 

slight ^p^eater flexibility into the diagnosis. 

The subject characteristics are reduced to a 

single slot, if necessary, by a quick check with the 

verb endings- Then three corci)arison terms are generated, 

♦ 

matching the subject against the two verb characteristics 
independently and against the composite of both: 
subject /stem, subject /ending, and subject/stem/endir^g. 
The match criteria are the same as above: exact match" 
for person and number* (Wo verb type is associated with 
the subject characteristic, so there i's.no screening on 
that term at this step. ) 

Based on the various composite characteristics, 
the program can make quite detailed error comments. 
Here are a few examples of output from the j)rogram:' 



ERIC 



^ RUN 1 (abbreviated) 

Description £;enerated by, human expert: 
(STMT (SJ ICH) (V'ERB SEHEN)) " 

SliTEIJCE IKPUT BY STUDeJT 
ICii SEflE 

mi PROCEEDIJ^'G WI^K ANAI.YSIS 



.......... Verb .ending check. ,. OK! . .• ■ 

Phrase division done, sentence type report f 

Final verdict is FV-2 , level of confidence: VllRXHIGl-] 

SujT'ffiary Coniments ' . ' - 

Clause type correct, (FV-2) 

— ^ ^ — FJ^D' OF FJUN — ^ • 

. 2 RUN 2 — — — (abbreviated) 

Description generated by human expert: • 
(STI^T {SJ ICH) (VERB SHM)) 

SENTENCE INPUT B):^ STUDENT ' 

'ICH Sim . ■ ( - 

" ■ 'i . 

. .'. . ' VER.B SEHEN . . APPEAR'S AS ■ SIEH ' , r 

-VERB DOES.KOT SHOW THIS Il^REGULARITY , Should be 20 is^jO 
■ ' ■ •. ERROR IS IN SrEI-4 ' • . ■ 

ERROR IS IN FINDING Should be E is *T* 

Summary. Comments 

Clause type correct (FV-27 



_ '. — EJ^]li.3 — (a'btreviated) 

^^Description generated by human experts ^ - , 

ISTWT (SJ ICH) (VERB SEHEN)) 

SElflTElicE lUPUT BY STUDMT ' ' 

, m SEHEN, - . ' 

, •< 

" PN SUBSTITUTIO.r;'.. Should be ( V »1 S *• PER) 

■is ((.3 NOM S K FFR) (WORD ER) ) 

:VERB SEHEN APPEARS AS SEHEI'i 

' - aWFINITIVE PGRJ'l USED . / 

' . ' ■ - 

Summary Comments 
Clause type correct^ (PV-2) / 



OF RUri 



r- RUI^ 4 : — ■ — (abbreviated-) ' ^ 

[note change in expected sentence] 

Description generated by huinan expert: 
(STMT (SJ E6j (VERB SEHEN)) 

. SEKTENCE INPUT BY STUDENT ' ^ ' ' 

SIE SEHEN. 

PN SUBSTITUTION.. Should be (3 JVOR S U PER) 

is ( (5- NOM P * PER) (WORD SIE) 

Verb ending check... OK! ' , 

Summary Cocnnents ■ 
Clause type correct (FV-2) ; * 

— — END or RUN — 
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The . following:, table' gives a, fairly complete presentation 

of thefin^ analysis algorrthm: which kinds of errors 

are, .covered and' how they are detected. To save sj^ace, 

I ^lave employed a few ahbreviaitions in the table: 

SE - stem/ending characteristic slots 

.JS - subject/stein , " 

JJi] - subject /endins - ■ . 

JSE - suhject/stem/ending- ' ' 

SC - stem consonant ' . ■.. ' 

SV - stem vowel ' v " 

(verb types associated -with specif ic . character is tics) : 
. JS type, JEtj'-pe, JSEtype, Ctype (Correct type). 

"bad XXtype" means that the logical "MD" of ^^that type 
with the correcl: type, yields zero. 



ERROH TYPE 



INDICATION 



COMl-iEM-S 



general 



wrong type 
verb 

stem error 



ending error 



JSE nuHr -or 
JSEtype =0 

bad" JSEtype 



JS null, or 

bad JS^ype 



ste'ni-spec = *'C" 



SC-spec = "X" 
SV-spec =. "XV 

JE null, or 
bad JEtype 



"verb ioria err-or" 

"missed an irregular it\" 
or "extra irregularity" ' 

"error is in stem" 

if stem-spec = "C"; then add 
"stem type 'confused', 'maybe 
improper formation" 
"stem confusion,' though ' 
grammatically correct if 
assumed to be s23" 
"incorrect formation of SC'J 
"incorrect' formation of SV" 



"error is^ in ending" 



no errors 



no error flag "verb ending -check .. OK" 



; . Appendix E 

f 

Exact Flow of PHRASEDIViDER , 
the Algorithm Which Detenu ines Clause Type 



The presentation of multiply-cascaded conditignale 
can be di'fficult to follow. I have chosen an informal 
scheme in which indentation is. used to indicate. the scope 
of the condition., Purthermore, the condition is given .by 
' merely stating -the boblean* condition, (which may be a 
Single variable). I have tried to avoid' if - then - else 
if — then — else type structures, rather repeating some of 
the condition each time; parentheses are sometimes used 
when a condition appears which the program actually 
expresses by the structure of the conditional rathisr than 
an explicit ^check. * , ' 

The innermost of the conditions is followed by a 
colon; the text after that refers to the conclusions 
appropriate at that" point. 



PHELIMIIiARY process™ : ' > - 

The clause is divided into sections delimited 

by the verbs: 

one verb: '-"A" — elements before verb 
* "B" — elements after verb 

"MODI" indicator set "true" . 

two verbs: "A" — elements before first verb 
•f ■ ' ■■ I'git — elements between two verbs 
"C" — elements after seco>]d .verb 
. . "FVDI" indicator "true" if finite verb 

preceeds dependent infinitive. 
"DirV" indicator "true" if dependent 

infinitive preceeds finite verb 

set "LA" to .-the number of elements in section "A" ' 

-(mnemonic: Length of A) 
set "LB" ^nd "LC" likewise (IC 0,if C is empty) . 

i • ' ' . - " 

MAIK fXOW: 

1. (Fy-1 cate£ories) • ■ ' 

lA - 0 (finite verb is very first eleEient in clause) 

then: Conclude PV-I with coni 'iddnce "very high" 

(henceforth written- 'FV-I, ''very high" ) 
: Go to "check" step. 



'2. (i'V-L ciate^ories. .first a general, all-encompasSin 
^ ' , conditional; then specific checks; 
PIPV- ^ (inverted verb order, or 

or l^ODI and LB - 0 (finite verb only" and at 

end of clause, or 
or rVDI and LE = LC = 0 (non- in verted verb order ^ 

■ ■ ' but both verbs together . 

at end of clause,, or 
. or Rel. V/ord present and' ( specie! category for slightly 
NODI and LB = 1 • (distorted forms that may , 

* or PVDI and LB = 0 : v , (still be relative clauses./ 
and LC = -1 • 



then- * (now dlsentajn^e various possiLilites) 
A Rel. Word missing 
DIPV ; 

LA - ^: rV-L, ••DediuE" 

LA > 1 : FV-L, "high" 

not DIKV / - 



LA = 1 
LA = 2 
LA > 2 



FV-2, ."high" 
PV-2, "medium" 
P.V--L, "mediuin" 



' DIFV and Rel. V/ord or Question Word present 

LA = 1 : PV-L,- "high" . ' 

. LA > 1: .FV^L, Mveryhigl:!" 

(-or else-) " , . 

Rel. Word present (KODI or *FVDI) 

LA = 1: . FV-1, "hi£}i" . 

LA > 1: FV-1, "veryhigh" 

Question word present (WODI or FVLI, vgrbs at end) 
LA'= 1: ■ FV-2, "high" 

LA > 1:- ■ Tv'-L, "high" 

(no ftel. Word or Question V/ord, and). 
DIFV and LB = G and LC neo 0: (possible DI in 

.V ^ Front Field, l'V-2) 

Add Dl to Front Field members ("A") 
'Increment LA by ; 1 
' Go to step 3 » FV-2 processing, 

■otherwise (DIFV, no Reli Word, odd placement) 
LA ^' r: FV-L, "low" 

LA > 1 : / FV-L,- "med" 

(FV-2 forms) • 

(no check, include everything not already claimed) 

' ' conclude: Fy-2, "veryhigh". 



CHECKING OPERATIONS (Quoted material here represents 

diagnostic messastes) 



End field choclc, check for either null or occupied by. a 
single eleEient which qualifies for that position. First 
decide 'which field — ' „ . , 

' KODI and type = FV-L: Check w?,U be .on field 
• F-VDI: Check will be oh "C" field- 

JDIPV ML TYPE = FV-2: Check wil 'be. on "c" field 




Let "^l" be the' End field ("B" or "C") and "JJ^:" its len£^th 



LE = 0: " '"fine ' : 

LE = 1 and."-E" element 
can be in end field: fine. ■ ■ 

otherwise: - Set "CFi^R" indicator ("(i-errdr"). for 

later reference; also, . 
if FVDl & LB/= 0 Record "DI position error, not 
/■ separategL from FV") 

else ' 
- 'if LB = 1. & ."B" element 
is Subject & either 

type - FV-1 or , ~ . 

type = FV-2 & Question 

.•Word is present: decrease confidence two 

■ steps, "E.F. error (En^aish?); 
else if LE = 1 : decrease confidence, 

"^end field error" 
else if FVCI & LB = 0: decrease confidence j " 

"DI not separated" 
otherwise: ' Record "not German" 

declare no confidence, 
• ' ■ "End-field too big" 



/ . 



GENERAL CHECKING, CONTINUED, 



if type = EV-L, 
•DIFV ^ IS neqO: 

FVDI: 



decrease .confidence, 

. "elements "between PI and FV" 
decrea^^e confidence, "EV proceeds • 
infinitive in EV-L"- 



if type = Fv-2i ) ■ . 

Rel, Word present: decrease confidence, "Rel word present" 
cheek LA (siiatement should have exactly -one element 
in .front field);" 



LA = T 
LA >1 & DIEV; 



LA > 1 & not,. DOT, 
LA- =2 & not CERR: 



LA > 2 or CERR: 



LA > 1 and not DIEV 
and Subject in .'^A" 
and no Object in "A" 



^ine;^ - 
declare no confidence, 

"Eront Field too big 
. .for Df in E.E. EV-2" 
Front I'ield. error indicator set 
decrease confidence, "two 

member front field" • 

decreasti confidence two ^eps, 
' "E.E. overly " large" 



decrease confidence, ^ 
. "Possible. English 
. pattern in E.E. error" 



SEPARABLE PREFIX CHECKS 
(omitted if confidence level is "none") 



Prefix position confused: 

type = FV-L • 

Prefix detached: 
Prefix attached: 

type = FV-2 

Prefix attached: 
Prefix detached: 



decrease confidence 

decrease confidence' 
increase confidence 

decrease , con f idence 
increase confidence 



end of PHRASEDIVIDER 



■■ Appendix F 

Examples of PHASEDIVIDEH Operation 

' ]' ' ' ' . 

The following represents direct output from the 

.two main word order fiinctions; PHRASEDIVIDER and 

PHKASETYPEREPORT. To facilitate a demonstration using - \ 

man;;!^ different sentence forms, I created a short test 

program, which allows a shtarthand specification of a 

sentence merely as a list of clause-element names. 

The particular abbreviations used are: ; . ' 

S subject' 0 - object M - verb Wodifier 
PV — finite- verb, DI dependent infinitive 

, PiW - relative word (subordi-nating conjunction, ^ • 

■ relative pronoun..) . " " 

QW - question word . .' . , / 

The following -additional abbreviations are a .little • ' 

less transparent. They specify "separable prefixes, and 

must indicate which verb is involved, as well as whether 

the prefix is real3.y to be conjsidered as attached "(a difficulty 

created by the shorthand representation). The following 

list Will- be-'-au/tmented. by explanatory comments immediately 

preceeding e^h example in which a prefix abbreviation occurs. 

■ ■ ■ • V ■ - ■ ■ - . • 

PVPFX - finitte verb^ prefix,' separated 
PVAPPX - finite verb. prefix, attached 
' LIPFX dependent oinfinite prefix, separated 
DIAPPX.- dependent infinitive prefix, attached. ' ' " 

^, . : /, . ■ ^ ; - ■ . ■ , . -» 

A last note~:» some error messages attempt to identify 
the, text of the offending- word. Because of . the shorthand 
used here, .no text is available, so. the symbol ".TEXT" will 
appearrinstead. - • 



SMTEI^CE: (S DI FV o' M) ' EXl'ECIED TYPE: i-'V-2 

■ - I ■ . 

V^Phrase division done, sentence type rei)ort: 

No confidence in sentence type, probably confiised 
last straw, was:. F.P. .too bic for bl in P.P.— pV-2 

■ Play by Play account oi' analysis: 

• Due to: INITIAL HYPOTfiFSiS ■ , PVT se^ to PV-2 

Confidence, in PVT sett ix-ig is; VtK-YrllGH 
Due to: F.P.' too bie for DI in P,P. — PV-2 
j^^^' . Confidence in PVT setting is: NOKE^ 

PD VALUE: CONPUSFD . " . 



SENTENCE:* (S O PV) ■ Ji^XPECTED TYPE: PV-2 ^ 

,i Phrase division done, sentence type report:., 

- Pinal vetdict is PV-2 , level of confidence:. MED' 

. ^ 

^ Play by Play account of analysis: 
. ■■■ Due to: -INITIAL HYI'OTHESIS , PVT set to PV-2 

Confidence in PVT settine is: VERYHIGH . 
' . • Due to: TWO MFPiBElR PPONT FIELD. " • • 

Confidence in PVT sett inr. is: iilGH 
Due to: Possible English pattern in' P.P. error / 
Confidence in PVT setting is: MED. 
». other comments: 
' Front field error (PV— 2) : 2 elements in P^.F. 



Summary Cominents 
Clause type correct (FV— 2) 



PD VALUE: FV-2 



; ..-SENTENCE: (S FV 0) ' . EXPECIFD TYPE: FV-2 

.......... Phrase division -done , sentence type report : ' . 

• Final verdict is FV-2 , level of confidence: VERYHIGH 

Play by Play account, of analysis: 
■ • Due to: INITIAL HYPOTHESIS , PVT set to FV-2 . : • 

• Confidence in FVT set'ting is: VERYHIGH 

. - Siaminary Comments 

Clause type correct (FV— 2) 
o ■ : , , ■■ , r ^ 

ERJCpd VALUE: IV-2 . ■ 



SEI\TE>jCE: , (RW PV 0 M) E/iPECTEQ/rYPF : FV-L • ' ' 

.......*... Phrase divisidr dor.e, sentence typ.e ^report: 

* Pinal [verdict is PV-2 , level of confidence: HIGH 

Play by Play, account gf analysis: 

Due 4;o: INITIAL HYPOTHESIS , FVT set ^tc PV-2 

Confidence .in iVT setting; is : VER-YHIGH 
^ Due to: RET^WOHD PRRSEI-.T . 

Confidence in PVT setting is: HIGH v' 

J**. Swmary Comuients 
Relative clause failure. . • 

iTb RELr-wcrd, probably ^not correct intent 

PD Value: fv-2 ' 



SEfJTEJJCE: (QW S 0 FV) EX^lclCIED TI^Ez -FV^2 • . 

Phrase division done ^sentence tj/pe report: 

Final verdict is FV-L , level oX confidence: HIGH 

Play by Play account of ansiysis: 
Due to: INITIAL HYPOTHESIS , FVT set to FV-t 
^ r Confidence in FVT setting is;. HIGH 

. , 

Svunmary Corments 

Apparent shift ^o PV-L / REL form for WORD question 

PD VALUE: PV-1 



SEKTENCE: (FV S 14 DI) EXPECTED ^YPE: PV-1 : 

Phrase division done, sentence, type^^ report : 
Pinal verdict is PV-1 , level of confidence : VERYHIGH 

.* ■ Play by Play account of aneilysis: . • ' 

Due to: INITIAL HrpOTHESIS , FVT set" to PV-1 

Confidence, in FVT setting is: VERYKIGH ' - 

.......... Summary Coiraneints . 

Clause type correct (FV-TJ 

PD VALUE: FV-1 , 'i' " •', 



SJMWCE: in s FY- 6 il" DI) EXPECTED TYPE: FV-2 

'phrase .division done, senter.ce tj.-pe report: 
Final verdict -is FV-2 ; level of confidence: MED 

Play by Play account of ana"^ 

Due to: IiailAL HYPOTHESI" o FV-2 

i ^ 'v. :-:. Confidence in IVT u. .^ERYfilGH 

DueVrt b ? TWO MEMBER' FROWT , 

Confidence' in /FVT S(jiL±ii^ is: HIGH 
■ Due to: Possible' Enel^i/eh pattem in F.F.' error 

* . Confidence in/FyT. setting is: i-J^ 
■ -." other coDunentsf // -'k . ^ . 
-r Front field error (FV^2)': '2 eleir.ents In F;F. 

■ ■^ ■ ■ ■ . / ■ ■ ■ ■ • - , . ' ■ / 

,,vi.,- ... Sumiaxy Coirjiienjts 

. Clause type correcit - (FV-2) ^ 

PD VALUE: i''V^2 . " " 



,SE1^TENCL'-: - (S FV Dl O.M M) , EpECTED TYPE:- FV-2: 

........... . Phrase division done/ sentence type report: 

Final verdict is FV-2 , /level of con£idence: HIGH 

■ Play by Play accourit of analysis: 

, Due to: INITIAL "HYPOTHESIS , FVT set to FV-2 

Confidence in FVT setting is: VEJJYHJGH^ 
Due to: DI riot separated ■ 

Confidence in FVT ".setting is: -HIGH 
• . ... other conments: . - _ 

DI position error,' not separated from FV 

' ' ■ ■ 

Sumiiiary Comments 
elapse type correct (FV-2) 

PD VALfiJS: 
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SEIJTENCE: (DI FV 0 M S) 



EXPECI'ED TYPE: FV-2 



Phrase division done, sentence type report: 



Pinal verdict is PV-2 ,,,, level,, of confidence: VERYHIGH 



Play by Play account of analysis: - ■ 
Due to: INITIAL HYPOTHESIS , FVT set to.PV-2 

Confidence in jPVT.se ttine is: VERYHIGH 



Siommary Comnents .. 
Clause type correct (FV-2) 



PD VALUE: IV-2 



note: in the fiOjl owing example, "FVPFX" denotes a 
separable pi^eMk associated with the finite' verb 



SENTENCE (S FV 0 14 FVPFX) 



EXPECTED "TYPE: l''V-2 



... Phrase division done,' sentence type report: 
Final verdict is FV-2 , level of confidence: VERYHIGH 

' ' ■■ ■ . ' ' '- - ' ■ ''' . ; ' ' " 

Play .by- Play account of ai^tal-ysis: 

Due to: INITIAIi HYPOTHESIS , FVT set to FV-2 

Confidence, in FVT setting is: VERYHIGH 
Duetto: SEPARAEL£r-PREFIX IS DEIACHEI) . .■ ' , 

, .C0nfiderlce in FVT setting is: VERYHIGH 

. . .^^liminary Cbinments: * 
Clause type correct (FV-2) 
Prefix separated, position is Correct 



PD VALUE:- FV-2 



note: .in- the following .ey.aiEple, TOBT' demotes a 
separable prefix associated with the finite verb; * 
It .should be considered-as not separated, even thou£ji 
here vi^itten separately. ; , 



SEl^TENCE: (S O PVAPFX FV), " ' EXPECTED TYPE: , EV-r 



« « • « •v,« * • ♦ • 



Phrase division done,, sentence type report: 
Pinal verdict is B' 2 .^ level of confidence:' LOW 



-Play by Play ar .uui. analysis:. . ■ ■ i 
.Due to^: INITIAi. HY? ^ESIS: ^set, to 

" f :Confidei,v.o EVT^setti-nfi is: V£:RY1iIGH 
Due to: TWO- MEl'^IBER PROIiT PIELDV^' . 

Gonfidence in- EVT setting, is: HIGH 
Due to: Possible En^l-is^ pattern 'in F.E... error 

■ 7 Confidence in TVT setting is: MED 
Due to: SEPARAELE-PREPIX HOT DETACHED. 

■ ; Conlfdence in PVT setting is:- LOW 
otlier comments: 
.:lrQnt,. field jiTOT^^i^ 



*.* - Suminary. Comments 



Relative clause failure,., : ' 
no REI-word, probably not correct intent 

■ ■ . f) 

Error in finite verb separable prefix, must be 
/ separated and at end- of clause in P\/^2' clause 



PD VALUE: FV-2 
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note: iri the 'following example, "fVPl'X" denotes a 
sex^arable prefix associated v;ith the finite verb 



'SEi^iTENCE: (^. 0 FV PVPFX) EXPECTrXi/iTPE,:, a 

Phrase division' dor.e, sentence tj^pe report:. • 
Final verdict is FV-2 , level of confidence: HIGH 

Play by Play accoun"t of analysis: 
^ Due to: INITIAL HYPOTHESIS FVr set to FV~2 ^ 
Cor 'ence in PVT setting Ig: VERYHIGH 
D- IIBER FRG^^T FIEI.D. 

Cuxi. ^.u^nce in' F.VT setting is: HIGH 
• Due to: Possible Engliish pattern in F^.Y.. error 
Confidence in FVT 'setting is: I-xED 
V ..Due to:' SEPARAELE-PREFIX I£ DEa'ACHFiD . 

Confidence in FVT setting is: HIGH J" 
. ' .,■ othoj* comments: . > 
.V Front field error (FV-2) : - 2 elements in F.F. 



, - "Relative clause failure. . . 
\ . • \ no. REIr-wordy probably not correct intent- 

Prefix separated, position is Correct ■ — • • 

- PD VALUE: Fy-2 , ' I 

— note: in the f611owi*ig example, "DIPFX" denotes a 

. .separable prefix associated with the dependent - . 

infinitive. t^' 

^ ' ^ ■ - ' . , ■ . 

■SEM-ENCE: ' ( S FV 0 DlPi"^ .• M DI j r • EXPECTFJ). TYPE : FV-2 

Phrase division done, sentence type report: 
" Final verdict is FV-2 , level of confidence: HIGH 

Play by Play account of analysis: v 
' - • / Due to: INITIAL HYPOTH&IS S ^'-VT set to FV-2 

Ponfidence ijn FVT; setting is: VERYHIGH 
" Due to: Separable-prefix confu 

- ^ 'Coipfidence in FVT setting is: HIGH 



4 



.......... Suniinary Comments : ' ^ 

Clause type correct (FV-2) yi 
•■ Prefix check .for verb^ "TEXT" ( DINF) FOLLOWST: i , 
■ ■ Position wrong (confused?) .. separated and before vej^b 



PD VALUE: FV-2 
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note/ in the following example, "DIPFX" denotes a 
separable p-efix associated with the dei>e:ndent 
ir^initive. , - ' ' . 



SET^'ENCE: (S IV C DIPPX DI) EXPECl'f^D TYPE: FV-2- 



. Phrase division done, sentence type report: . 
Pinal verdict is, FV-2 , level, of confidence: HIGH 

Play by Play account of analysis 
Due to: INITIAL HYPOTHESIS , FVT set to FVc-2 
*^ - Confidence in FVT setting is: 'VERYHlGH 

Due to: Sei^rable--pr iix' confusion 

• Confidence in FVT setting is: HIGH 



« * • *' 



SujEmary Comments . 
Clause type correct (FV-2) 

Prefix check for verb 'TEXT" (l3lWF) FOLLOWS: 
Position maybe okay*. 

(imined. before verb, but sep^irate word 



PD VALUE; FV-2 



note: in the following example, "FVPFX" denotes .a 
separable prefix g^ssociated with the finite vei^b) 



SEIJTENCE:^ (RWO M FV DI FVPFX) , E>GPE(n'ED TYPE: FV-L' 

Phrase division done, sen tehee: type report: 
Final-yerdict is FV-1 , level of conridence: MET) 

Play by Play account of analysis: . 
^ ' Due to: INITIAL HYPOTiiESIS -, FVT set-.to FV-L / 
/ • • Confidence in FVT setting is: VERYKIGH 

i r Due to: FV preceeds DI in FV-L clause. 

Confidence in FVT setting is: HIGH . 
' Due to: PREFIX IS SEPARATED 

Confidence in FVT setting is: MED . 

Summary Comments 
^ Correct, verb position for relative clause 

^. Error in prefix, must hot be ' separated when 

'verb is in fy^l position^^^ "^^^ 

PD VALUE: FV--L ; V : : ' '-^^ ■ f^^^^^^^ 
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note: in the following example, '^DlPi'X'* denotes^a 
'separable prefix associated with the dependent 
infinitive^ 



hmimcET <RW o m dippx, di rv) expected type: pv-^l 



Phrase^ division done, sentence type report: 
Pinal verdict is PV-L , level of confidence: HIGH 

,. ■ . ' » 

Play by Play account of analysis: 

Due to: INITIAL HYPOTHESIS ,'FVT set to FV-L. 

Confidence in PVT setting is: *VeRYHIGH 
Due to : Separable-prefix confusion 

Confidence in PVT setting is i HIGH 

...Summary Comments 

Correct verb position for relative clause . 
Prefix check for verb "TEXT" (DH\T) FOLLOWS: 
Position maybe okay.. 

(imraed. before verb, but separate word 



PD VALUE: PV-L 
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